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Approximate  and  local  linearizability  of  non-linear  discrete-time 
systems 


HONG-GI  LEEt  and  STEVEN  I.  MARCUSt 
We  consider  a  single-input  non-linear  discrete-time  system  of  the  form 
I:  x(i+l)  =  /(x<r),u(t)) 

where  xeR",  ueR,  and  /(x,u):  Rv* 1  -»R"  is  a  C®  Revalued  function.  Necessary 
and  sufficient  conditions  for  approximate  linearizability  are  given  for  2.  We  also  give 
necessary  and  sufficient  conditions  for  local  linearizability.  Finally,  we  present 
analogous  results  for  multi-input  non-linear  discrete-time  systems. 


1.  Introduction 

We  consider  a  single-input  non-linear  discrete-time  system  of  the  form 


I:  x(f  +  l)  =  /(x(f),u(r)) 


(1) 


where  xe  R*  ue  R,  and  /(x,  u)  :  Rw  + 1  -» R*  is  a  C®  Revalued  function. 

Many  authors  have  studied  (local  or  global)  linearization  (Cheng  et  al.  198S,  Hunt 
and  Su  1981,  Jakubczyk  and  Respondek  1980,  Kroner  1973,  Su  1982)  and  approxi¬ 
mate  linearization  (Kroner  1984)  by  state  feedback  and  coordinate  change  for  non¬ 
linear  continuous-time  systems.  In  this  paper  we  discuss  necessary  conditions  and 
sufficient  conditions  for  local  linearization  and  approximate  linearization  by  state 
feedback  and  coordinate  change  for  non-linear  discrete-time  systems.  A  necessary  and 
sufficient  condition  for  local  linearization  has  recently  been  found  by  Grizzle  (1985  c); 
a  result  equivalent  to  this  is  proved  in  our  Theorem  5.  These  conditions  arc  very 
similar  to  those  available  for  continuous-time  systems,  but  they  are  more  difficult  to 
calculate  than  our  sufficient  condition  in  Theorem  4.  Other  related  work  on  non¬ 
linear  discrete-time  systems  can  be  found  in  Grizzle  (1985  a,  b).  Grizzle  and  Nijmeijer 
(1985),  Monaco  and  Normand-Cyrot  (1983  a,  b,  1984). 


A 
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r 
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Definition  1  •  I 

A  point  (x«,  u,)  such  that  /(x„  u.)  =  x,  is  called  an  equilibrium  point. 


Now  consider  the  following  linear  discrete-time  system  L0: 


I0:  y{t  -t-  1 )  =  Ay(t)  +  bv(t)  =  g(  y(t),  u(r)) 
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where 


0  1  0 
0  0  1 
0  0  0 


0  0  0 

01 


0 


0 

1 

0 


(N  x  N  matrix) 


0 

1 


(N  x  1  matrix) 


Similarly  to  the  continuous-time  case  (Krener  1984,  Su  1982),  we  can  define  local 
linearizability  and  approximate  linearizability  for  a  discrete-time  system.  Let  (xe,  ut) 
be  an  equilibrium  point  of  £. 


Definition  2 

I  is  said  to  be  locally  linearizable  at  (xt,  ut )  if  there  exist  an  open  neighbourhood 
U  (cRN+l)of  (xe,  ue)  and  a  diffeomorphism  T:  U -»  T(U)  such  that 


(i)  T=(TU  T2, Tn )  are  functions  of  x„  x2 . xN  only, 

(ii)  r(xe,ue)  =  0(JV+l)Kl, 

(iii)  T°f  =  g°T. 


If  we  let  (y(t)Tr(r))T  =  T(x(t),  u(t))  then  y{t)  and  v(t)  satisfy  E0-  Definition  2  indicates 
that  we  want  to  find  a  diffeomorphism  T  such  that  the  following  diagram  commutes: 


R", _ l _ R*  + 1 


R% - - - RN  + ' 

Once  we  find  such  a  diffeomorphism,  we  can  apply  linear  system  theory  instead  of 
non-linear  system  theory. 


Definition  3 

£  is  said  to  be  approximately  linearizable  with  order  p  if  there  exist  an  open 
neighbourhood  U  (c  R*  +  I)  of  (x„  ut )  and  a  diffeomorphism  T:  V  -» T(U)  such  that 

(i)  T = (T„  T2, ....  Tn)  are  functions  of  x,,  x2, xN  only, 

(ii)  T(xe,ue)  =  0(N  +  l)xl,  and 

(iii)  T°f  =  g°  7  +  0(x  -  x„  u  -  «,)p+1. 

Thus  in  Definition  3  we  consider  the  following  nearly  linear  discrete-time  system: 
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where  the  N  x  N  matrix  A  and  N  x  1  matrix  b  are  the  same  as  I0.  Clearly,  local 
linearizability  at  (xt,  u,)  implies  approximate  linearizability  with  arbitrary  order. 

In  §  2  some  background  material  is  reviewed  and  notation  is  defined.  In  §  3 
necessary  and  sufficient  conditions  for  approximate  linearizability  will  be  given  for  the 
system  (1).  Also,  we  shall  give  necessary  and  sufficient  conditions  for  local  linear¬ 
izability.  We  can  define  local  linearizability  and  approximate  linearizability  for  multi¬ 
input  discrete-time  systems  similarly  to  Definitions  2  and  3.  Then  the  multi-input  case 
will  be  discussed  in  §  4. 


2.  Preliminaries 

In  this  section  notations  and  definitions  to  be  used  later  will  be  mentioned.  The 
Kronecker  product  is  very  useful  in  the  field  of  matrix  calculus  (Graham  1981).  First, 
define  the  Kronecker  product  ®  by 


A  ®  B 

fj  x  q  irxh 


B 

auB 

<>21 

B 

a22B  .. 

auB 

_aPi 

B 

OpzB 

a„B 

where  atj  is  the  (i,  j)-component  of  the  p  x  q  matrix  A. 

Define  the  derivative  of  a  matrix  with  respect  to  a  matrix  by 


We  also  define 


d 

P  „ 

P  1 

- — B 

- — B  ... 

- — B 

&»n 

<U|2 

? 

p 

P 

- B 

- B  ... 

- B 

da2l 

II 

rs 

•  ' 

da2q 

P 

P 

P 

- B 

- B  ... 

- B 

dapl 

dap  2 

J 

D°aB 

=  B 

D\B 

=  DaB 

D‘flB 

=  Da(D'aB) 

for  i  ^ 

Let  h(x)  be  a  scalar  real-valued  function  of  xeR*.  Then  (£^/i)(x)  and  (Dkjh)(x)  are 
Nk  x  1  and  1  x  Nk  vectors  respectively. 


Fact  (Vetter  1970,  1971) 

Using  the  definition  of  Kronecker  product  and  derivative  operations  on  matrices, 
Taylor’s  formula  can  be  expressed  by 

h(x)  =  h( 0)+  £  ^(D‘,Tfi(x)),  =  0(*<8>*®  ®x)+RJ ♦,(.*•) 

where  R,+  1(x*)  is  a  remainder  term. 
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Now  define  the  Nk  x  Nk  permutation  matrix  UilJ2 as  follows:  the 

<(a4l  -  1)N‘_I  +(ah  -  1  )N*'2  +  ...  +  (£(„.,  -  l)N  +  ai(i)th  column  of  Utl  h . ik  is  the 

((a,  -  1)N*~ 1  +(a2  -  1  )Nk~2  +  ...  +(a4_ ,  -  1  )N  +  a„)th  column  of  the  Nk  x  Nk  iden¬ 
tity  matrix  (INkxNk)  for  1  <  a,,  a2, ....  ak  <  N  (the  {a(}  are  related  to  the  ‘base  N’ 
representation  of  the  column).  Here  {i,,  i2, ....  it}  is  a  permutation  of  {1, 2, ...,  k}.  For 
example,  when  N  =  2  and  k  =  3 

^123  ~ 


10  0  0  0  0  0  0 
0  0  0  0  10  0  0 


^321  — 


0  0  1  0  0  0  0  0 
0  0  0  0  0  0  1  0 
0  10  0  0  0  0  0 
0  0  0  0  0  10  0 
0  0  0  10  0  0  0 
0  0  0  0  0  0  0  1 


Let  A  be  a  p  x  Nk  matrix.  Define  the  operator  ©  by 


(BA  =  A 


I  all  permutations 

\[i t.h . ik)  of  {1.2 . *} 


For  example,  when  A  is  a  p  x  Ni  matrix 


®A  —  >4(1/,  23  +  1/,  32  +  ^213+^23 1  +  ^312  +  ^32 1) 


i(dA  idA  (A . (z-rm  i 

(\5u/(0_o,  \dxJ(0,0\du/{0'0)  \dxj(0.0)\duj{0.0)) 

be  linearly  independent;  that  is,  they  form  a  basis  for  RN.  Define  f:RN-+R  by 
C(t>)  =  <xN,  where  v  is  a  1  x  N  row  vector  and 


T  V  (df\ 

v  -  L  «f{  t  )  t~  ) 

(  =  i  \tfx/,0i0)V?u/(0-0 


That  is,  ((d)  is  the  last  coefficient  of  vT  with  respect  to  the  basis  {w,.  w2, ....  wN},  where 
w,  =  {df /dx)[oQ)(df /du)(0,0),  1  Also  define  R'”'*-* Rp  by 


f(K)  = 


where  r,  is  the  ith  row  of  V. 
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3.  Single-input  case 

In  t his  section  our  main  results  will  be  given.  If  /(v.  u)  lias  an  equilibrium  point, 
without  loss  of  generality,  we  can  assume  that  /  (0.  0)  =  0;  for.  if  not.  let  v  =  v  v,  and 
i'i  =  ii  II,..  Then  \(f  +  I)  =  A  v(i).  ii(i))  i  / ( \  +  v...  li  +  u,l  \,  with  MO.  0)  =  0. 

Let 


/"'(  v.  ii )  —  /(v.  II) 

!  '  "(v.  ii)  =  /|/\\.  ii).  0)  for  I  $  /  si  N  I 

./  '(.v,  u)  represents  the  effect  of  an  input  1 1  at  I  =0  on  the  state  at  i  =  i.  /  (v.  ii)  is 
essential  for  solving  many  problems  arising  in  discrete-time  non-linear  systems. 


Lemma  I 

1  is  locally  lineari/able  at  (0.0)  if  and  only  if  there  exists  a  C‘  function 
h:  H'(c  [R')-*IFn  such  that 

(i)  M  is  an  open  neighbourhood  of  Oe  IRV 

(ii)  /)„(/i  l  ‘)  =  0  on  some  neighbourhood  of  0e  1RV  ’  1  for  I  <  i  ^  /V  —  I 


(iii)  det 


/i'i  A 

\<  •'  '  \  o 


<\h  f) 

r.v 


(O.OI 


<\h  r  •) 


(O.oi 


¥■  0 


(iv)  (D„(/i  /  ))((>,(>!  ^  0 


(v)  /i(0)  =  0 


Proof 

Necessity.  Suppose  that  I  is  locally  lincarizablc.  Then  we  have  a  diffcomorphism 
T.  Let  /»( v )  =  r,(.v).  (Since  T,(v.  id  depends  only  on  v.  we  can  write  T,(v)  instead  of 
T,(  v.  ii).)  Note  that  T2  =  T,  f.  Since  Dy(T2)  =  0  on  some  neighbourhood  of  the  origin. 
DU(T,  ./)  =  0  on  some  neighbourhood  of  the  origin.  From  now  on,  for  convenience, 
we  shall  omit  ‘on  some  neighbourhood  of  the  origin'  Note  that  T,  =  T;  /  =  T,  f2. 
(Actually,  we  can  write  T,  =  T,  / 2,  because  T,  /  depends  only  on  v.  But  f2  is  used, 
for  consistency  of  notation.)  Since  D„(T,)  =  0.  D„(T,  ■•/*)  =  0.  Proceeding  in  this 
manner,  since  Ts  =  T„  t  f  =  ...  =  fN  1  and  Dy{TN)  =  0.  D„(T,  Is  1 )  =  0.  Thus 
we  have  shown  that  DU(T,  /')  =  0,  for  I  I.  Since  T  is  a  diffeomorphism. 
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7j  =  T,  °/‘  1  for  2  ^  i  <  N  +  1,  and  T,,  T2, ...,  TN  depend  only  on  x, 


and  Du(h « /")  #  0.  Since  T,(0, 0)  =  0,  /i(0)  =  0. 

Sufficiency.  Suppose  that  there  exists  h :  R*  -* IR  satisfying  the  given  conditions.  Let 
7I(x)  =  for  1  < / < N  +  1.  Then  it  can  be  easily  checked  that  T°f  =  g °  T and 

T(0, 0)  =  0.  Since  det  {(dT/d(x,  u))(0  0))  #  0  there  exists  an  open  neighbourhood  V  of 
(0, 0)  such  that  T:  U  -*  1\  U)  is  a  diffeomorphism  by  the  in  verse  function  theorem.  □ 


Lemma  2 

I  is  approximately  linearizable  with  order  p  if  and  only  if  there  exists  a  C(iv) (v) * * * * x  function 
h :  W(  e  Rw)  -►  R  such  that 

(i)  kV  is  an  open  neighbourhood  of  Oe  R* 

(ii)  (D^D.ih  °f% o.o,  =  0(W  +  lfJ  „ ,  for  1  <  i  sg  N  -  1  and  0  $  /  ^  p  -  1 


(iv)  (Du{h°f% o.o,  #0 

(v)  h(0)  =  0 


Proof 

Necessity.  Suppose  I  is  approximately  linearizable  with  order  p.  Let  h(x)  =  T,  (x). 

By  definition,  T,  °/(x,  u)  =  T2(x)  +  0(x,  uf  * 1 .  So  o/))(0.0>  =  0  for  0  <  p  -  1 . 

Note  that  T,  °f 2  =*T2°f  +  0(x,  uf*1.  Since  T2  °f(x,  u)  =  T3(x)  +  0(x,  uf* 1  by  defi¬ 
nition,  T, °/2  =  T}(x)  +  0(x, uf*'.  Thus  (DJlDu(h°f1)\OO)-0  for  0<ys$p-i. 

Proceeding  in  this  manner,  we  can  show  that  (fyDJh  o/'))<0.<»  =  0  for  1  <  i  ^  N  -  1 
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and  0  sS  /  $  />  -  I.  Note  that  (^Tj/Av),,,  =  [i\h  /'  1  )/Av)|0 for  I  ^  /  $  N.  Since 


Y'^3 

\  <"’V  /(O.OI 

1 

>» 

(■TA 

V  '‘X  /t(l.O) 

^  0,  det 

\  '2x  /(O.OI 

f'TA 

Ar<  7<o.o,_ 

/W /"  ‘A 

A  ^  A.O, 

It  can  easily  be  shown  that  7,  /  v(x, u)  =  7* ,  ,(.v,  u)  +  0(x,  uf' ' .  Thus  (DJh  /  *)),„  = 

(D„7V ,  ,(.v.  #  ().  Finally,  /i(0)  =  T,(0)  =  0. 


Sufficiency  [hy  construction).  Let 


T,(.v)  =  h(x) 

&  limes 

T2(X)  =  X,  ^y(£>*T(/»  /)),o.<n('c®  V®  ...  ®.v)' 

7i(x>=  Z  /2Vo,(x®.v®  ...  ®.v) 

7*(*)  =  X  /N  ,)><o...,<<®*®  ...  ®v) 

*  -  I  K ! 

Ts+xix,  u)  -  T,  fN(x.u ) 

Then  it  can  easily  be  checked  that  T  as  defined  above  satisfies  the  conditions  of 
Definition  3. 

Now  note  that 


where 


(D7D„(/i  f  !),«,„ 


X  1  K  -  O 

I  (I 


(2) 


B'm.n  =  D7(Dj')T 

s:.,  =  *f  X  -  X'o?  1+1  ‘■(D./iT®D‘'  k'(0./"®DT'  *') 

*  j  -  I  fc  j  -  I  -  l 

x  (D./,t®  ...  07/,'  1  *'(P. /iT®DV  'DjiJ,  .))  forl^/^m 


and 
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(For  a  proof  of  (2)  see  the  Appendix.)  Let 

^11  ^12 
I  A,,  A 

Ak  = 


22 


Aik 
A  2k 


Ak  i  A 


k  2 


’a 


where  Atj  is  an  (N  —  1)(A/  +  1)‘  x  Ni+1  submatrix  defined  by 


Ay  —  0<fv  -  i  )(N  + 1  )•  x  ni  ♦ 
(■®/y)(0.0» 


if  i  <  j 


A»  = 


0(0.0) 


(By  )(0,0! 


if  i  ^  j 


Let  the  (N  -  1  ){N  +  1)‘  x  1  vector  be  defined  by 

(B  i4o)(o.oi 


P‘=- 


(B*o)(0.0) 


(Bf, o  1  )(0.o» 


(See  §  2  for  the  definitions  of  ©  and  f )  Also,  let  pk  =  (PlT  p2 


With  these  preliminaries,  we  can  state  our  main  theorems. 


nT- 


Theorem  3 

I  is  approximately  linearizable  with  order  p  (>  2)  if  and  only  if 


m{(g)  m  m . (ff  \ 

l\'U/l  0.01  V -VIO.O)  V  *V<0.0)  V^X/IO.O)  V1 


(0.0) 


are  linearly  independent 


(ii)  P  ,  e Image Mp_  J 


Proof 


Necessity.  Suppose  that  L  is  approximately  linearizable  with  order  p.  Then  there 
exists  a  function  h(.x)  satisfying  (i)-(v)  of  Lemma  2;  in  particular. 


(D.(h  on,  =  ®  =0  for  1  $  i  $  N  -  1 
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1 1 1 1 


Assume  that 


/ 10.01 


are  not  linearly  independent.  Then  there  exists  k  such  that  I  ^  k  ^  N  -  I  and 

V/  Y  (> J\  V  /  <7  Y  (<V 


=  I  *  ( 1 ,y  Y 

j  <!  v  •Vi<'-oiV"Wio.oi 


for  some  constants  |a,!‘ 


Thus 


and 


>/(y 
Pu 


'  “Yf) 

A.»>  i  <i  V* /iii.oi  io. oi 


=  0 


V'  '  /{D.l'i  V  1  11  ) 

This  is  a  contradiction,  which  implies  that 

YA  (tj\  ( <T 


(0.01 


are  linearly  independent. 

Recall  that  </)„<>.  A),,,,,,  =  (DTI)Jh  =  0  for  m  =  0  and  l  ^  i  ^  N  -  I . 

Since 


/<0.0> 


are  linearly  independent.  (B./'L  „  is  uniquely  determined  up  to  a  constant  multiple  (i.e. 
(D./i),  _(,  =  ac.  where  the  scalar  a  (  #0)  is  arbitrary  and  the  N  x  I  column  vector  c  satis¬ 
fies  <  r(<7  r.vi;„  „ .0,  =  o  for  0  sj  i  s:  N  -  2,  and  cr(c//<'v|('1.0,l(<7  cu)<0  ()|  =  1 1. 

Now  by  (it)  and  (iv)  of  Lemma  2,  (D?/)„(/i  /')l,o.oi-  ^  f°r  I  $  m  ■$  t>  -  I  and 


s;  ,'V  - 

1.  f  rom  (2) 

we  obtain 

0  0 

0 

Hi  .0 

«f .. 

0  0 

0 

«T.n 

, 

0  0 

Hi,  0 

0 

0 

(D./i).  o 

— 

dV  1 
*>l  .<> 

«!.„ 

B'  , 1  0 

0 

<»X  „ 

mV  > 

*>2.0 

0 

«;  ,.o 

K  .. 

K  1.2 

■  K>  l.c  t 

o.V  1 

D/>  1  .0 

I1 

«;  ,*  a 

O/V  1 

\.p  1 

10.01 

<DX  o 

(3) 
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Since  (B| ,0\o.Mh)M. 0  =( D!  D./rU.o^i*)«  =  o  «  *C((/>! D./T)<o.o,) 


*!.o 
B]  o 

~  1 

1.0 


{D\h)x_0  =  afil 


(0.01 


Thus  the  right-hand  side  of  (3)  is  -  «[(/?’  )T  ...  (Pf ' 1  )T]T  =  -  a , .  It  follows  that 
_  ,  is  in  the  image  of  the  matrix  on  the  left-hand  side  of  (3).  However,  the  {D^h}  are 
constrained  because,  for  example,  d2h/dxfdxj  =  d2h/dxJdxi.  Hence  the  stronger 
condition  0P  e  Image  (Ap  )  holds,  as  is  proved  in  the  Appendix  in  Lemma  A.2. 

Sufficiency.  Suppose  that  (i)  and  (ii)  above  are  true.  By  (i)  there  exists  an  N  x  1 
vector  C{  such  that  CJ{(df /Sx)\00)(cf  /du)i00)  —  0  for  O^/^JV  —  2  and 
C](8f/6x$0:0\(df/du\0'0i  =  1.  By  (ii)  there  exist  C2,  C3, ....  C„  such  that 


2!^ 


Ap-i 


3! 


C3 


P! 


!  0 


where  C,  is  an  JV'  x  1  vector.  Let 

h(x)=  f  ...  ®x) 

i=i*’  i - - - ' 


(times 


Then  it  can  be  easily  checked  that  {[HDy(h  °/')), 0,o)  =  0  for  1  <  i  $  N  -  1  and 
O^j^p-  1.  Clearly  (Du(h°fN))(0'0)  =  (Dlxhjl=0(df/dx)N0~0\(df/du\00)=  1  #0. 

Now  assume  that 

(*)... 

\  dx  A o.o 

_\  Bx  /(o,o»_ 

Then  there  exists  k  such  that  1  ^  k  ^  N  —  1  and 


det 


=  0 


(O^°(0o„  =  for  some 
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Thus 


This  is  a  contraction,  which  implies  (iii)  of  Lemma  2.  Hence,  by  Lemma  2,  E  is 
approximately  linearizable  with  order  p.  □ 


Remark 

E  is  approximately  linearizable  with  order  1  if  and  only  if  (i)  of  Theorem  3  holds, 
just  as  in  the  continuous  case  (Krener  1984). 

Now  a  sufficient  condition  for  local  linearizability  is  given  in  the  following 
theorem. 

Theorem  4 

Suppose  that  /(x,  u)  of  E  is  an  analytic  Revalued  function.  E  is  locally  linearizable 
at  (0, 0)  if 

m  m  . (*)"'(*)  1 

l\*W<0.0)  V^/(0.0)\^/<0.0>  W<o.o>W<o.o,j 

are  linearly  independent, 

(ii)  there  exists  k  (<  oo)  such  that  ftespan  (Cf)  for  all  /  >  1,  where  Cf  is  composed 
of  the  first  k  columns  of  A,. 


Proof 

By  (i)  there  exists  an  Nxl  vector  c,  such  that  c{(<?//cbc);0 0)(c?//£u)(0.o>  =  0  f°r 

0sJis£N-2  and  c](df/dx^0~0\{df/du)l0_0)  =  L  By  (ii)  there  exist  c2, c3 . Cj  such 

that  j  <  oo  and 


At 


2\Cl 


3! 


=  -  fit  for  l^j 


where  c,  is  an  N'  x  1  vector.  Let 
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Then  it  can  easily  be  checked  that  (fXDJh  /‘))<0.o»  =  0  for  I  ^  i  sg  N  -  I  and  s  >  0. 
Since  both  h(x)  and  /*  are  analytic,  h  /'(x,  u)  are  analytic,  for  1  ^  i  $  N  —  1.  Thus 
D„(h  •/*)  s  0  for  1  ^  i  3$  N  -  1.  As  in  the  sufficiency  proof  in  Theorem  3,  it  can  be 
shown  that  h  satisfies  the  other  conditions  of  Lemma  1.  t  I 

it  is  easy  to  see  that  (ii)  of  Theorem  4  implies  (ii)  of  Theorem  3. 

Remark 

Conditions  (i)  and  (ii)  are  also  necessary  for  local  linearizability  at  (0, 0)  whenever 
f{x,  u)  is  polynomial  and  a  polynomial  T(x,  u)  is  sought. 

In  the  following  theorem  we  give  necessary  and  sufficient  conditions  for  local 
linearizability. 

Theorem  5 

E  is  locally  linearizable  at  (0, 0)  if  and  only  if 

(t)m  m  m  ...MT'm  i 

(.VU/<0.OI  \<'*/<0.0»  VWlO.O)  \^X/(0.0)\^U/  (O.O)j 

are  linearly  independent, 

(ii)  there  exists  an  open  neighbourhood  U  of  Oe  IR*  such  that  A,  =  /*( A0)  +  /,2(A0) 
+  ...  +  /j(A0)  are  well-defined  (-dimensional  involutive  distributions  on  C  fori 
<  ( ^  N  -  I,  where  A0  =  span  {d/du}. 

Proof 

Necessity.  For  condition  (i)  see  Theorem  3.  By  Lemma  1  there  exists  h(x)  such  that 

conditions  (i)-(v)  of  Lemma  1  are  satisfied.  Let  y,  =  h(x),  y2=h  °f(x,  u) . and  yN  = 

h'  fNX(x,  u).  Then  by  (ii)  and  (iii)  of  Lemma  1  we  can  choose  (y,(x),  y2(x)>  •  yw(x))as 
new  coordinates  on  an  open  neighbourhood  U  of  Oe  BN.  By  (ii)  and  (iv)  of  Lemma  1, 
/*(A0)  =  span  {rVdyjy}'  Similarly,  /*(A0)  +  /„2(A0)  =  span  {d/dyN  t and  by 
induction,  AN_,=span  {d/dy2,d/Pyi,...,d/dyN}.  Hence  A*.,  is  an  involutive 
distribution. 

Sufficiency.  By  Frobenius’  Theorem  there  exists  a  Cx  function  h:<RN  -*  R  such  that 
A*  _  ((h)  s  0  and  (ct/i/dx)x=o  #  0-  Therefore  /j(A0)(h)  =  0  for  1  <  i  <  N  -  1.  By  (i),  since 
(c‘h/dx)x  =  0  ^  0,  (?/c'u)(h  °fN(x,  u))l0  0)  ^  o.  It  is  easy  to  see  that  h(x)  satisfies  the 
conditions  of  Lemma  1.  □ 

Remarks 

(a)  Condition  (ii)  of  Theorem  5  can  be  replaced  by  (ii)'  ker /„  +  (jr  ~  '/*)‘(A0)  is 
involutive  for  0<i</V-2,  where  re(x,  u)  =  x,  (rc,  '/*)0(A0) s  A0,  and 
(%lf*y<Ao)s(n*lf»)((x*lf*y~'(Ao))  for  j ^  I  (for  this  see  Grizzle  1985c, 
Lemma  2.1). 

(h)  A  more  geometric  necessary  and  sufficient  condition  for  approximate  lineari¬ 
zability  of  order  p  can  also  be  obtained  (Lee  1986);  however,  the  conditions  of 
Theorem  3  are  much  easier  to  check. 


Lmeari:abilm  of  non  /in«.o  Jn,  >.  '<  hi. 

Example  1 

Consider  the  following  discrete-time  non  mi,,  .o 
\  1 1  /  +  1 1 
\  ;(t  i-  1 1 
,  0  , 

Since  |<  /  i  u),„  „.  =  |  ^  )  and  i,  ' 

Note  that 

/('  ji/>  /> 

and  //'  =  0,  .  ,  for  i  »  - 

(B|,t . =  iO;f 'l .  ««/>.' 


0  II  i|  ' 

o  i  i  «•  j 

o  o  ii  :  | 

It  can  be  easil>  checked  that  all  elements  of  the  4th  column  ol  t  ,oe  n  tor 
Thus  /f,espan  (C*l  for  /  >  1  where  C*  is  composed  of  the  tirsi  four  columns  ot  4 
Therefore  nil  of  Theorem  4  is  also  satisfied  Hence  L  is  locallv  lineari/able  at  ill.  Hi 
Actually  we  can  construct  a  diffeomorphism  7=tT,.  7  7,1  in  the  wa>  that  is  given  in 
the  proof  of  Theorem  4.  Since 


v]  =  (0  0  0  -2).  Clearly  c}  =  (I  0).  Thus 

Tl(x)  =  xl-^x22  =  xl-xl 

T2(x)  =  7,  fix.  u)  =  x2  -  xf 
T,(\,  u)  =  T,  f2(x.  it)  =  x.  +  u-  (  Vs  +  2.v,i<  +  tr  )2 
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h-v 

Example  2 

Consider 

,v\S 

I: 

■ 

x,(f  +  1) 

X2(f)  +  2x1(f)u(f)  +  x,(t)2u(t)  +  u(f)2 

_x2(f  +  1)J  ' 

|_  x,(t)  +  u(t) 

=  /(x(t),u(f)) 


Clearly  (i)  of  Theorem  3  is  satisfied,  because  (?f/c'u){0  0)  and  (df/dx)i0  0)(df  /du\00 
the  same  as  in  Example  1.  Since 


<D!*>./TW  o,= 


2  0 
0  0 


Since 


0.0,  = 


^‘=d(D.D./T)(00l)  = 


0  0 
0  0 


0  0 
0  0 


0  0 
0  0 


0  0 
0  0 


p2  =  (2  0  0  0  0  0  0  0  0)T 


Also  we  have 


0 

0 

0 

2 

A  i  — 

0 

1 

1 

0 

0 

0 

0 

2  _ 

(&. 22)10.0 1 = 1 

[D;J  T)(o.oi  ®  (DfJ  T)(o,oi  ®  (Duf 

0 

r 

~0 

r 

= 

1 

0 

® 

1 

0 

®(0 

1) 

0 

1 

0 

1 

~0 

0 

0 

0 

0 

0 

0 

1  ~ 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

= 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

.0 

0 

0 

0 

0 

0 

0 

I . 

”  M  A 
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Since  e  Image  (.4, ),  I  is  approximately  linearizable  with  p  =  2.  However,  since 
ft2t Image <42).  it  is  not  approximately  linearizable  with  p  =  3.  Thus  it  is  also  not 
locally  linearizable.  Let 

T]=xt-xl 

T2  =  x2  -  xf 

T}  =  x,  +  u  -  (x2  +  2.x,u  +  xfu  +  u2)2 

Then 

y,U  +  1)1  r,(.x(f  +  I))  =  x 2 ( r )  +  2xt(tMt)  +  x,(r)2u(f)  +  ifit)2  -  (x,(r)  +  tfit))2 
=  Xj( I)  -  x,(f)2  +  xt(t)2u(t)  =  r2(.x(f))  +  0(x,  u)3 
=  y2(t)  +  0(x,  u)3 

v,(f  +  1)1  r2(.x|t  +  l))=  T3(x(/),u(f))lt<f) 

4.  Multi-input  case 

The  results  in  §  3  can  be  easily  generalized  to  the  multi-input  case.  Thus  in  this 
section  we  give  (without  proof)  a  sufficient  condition  for  local  linearizability  and  a 
necessary  and  sufficient  condition  for  approximate  linearizability  by  state  feedback 
and  coordinate  change  for  a  multi-input  non-linear  discrete-time  system  (for  proof  see 
Lee  1986). 

Consider  a  multi-input  non-linear  discrete-time  system  of  the  form 

I:  x(f  +  l)  =  /(x(r),u(f))  (4) 

where  x(t)elRv.  uO)eP",  and  f(x,  u):  Rv  *"-»  R*  is  a  C*  Revalued  function.  Also, 
consider  the  following  multi-input  linear  discrete-time  system  E0: 

I0:  >if  +  l)  =  Ayit)  +  Btit)  =  rf\lr).  tiO) 

where  >•( r ) e  P‘\  r(f)e  P".  A  =  block  diag  ,4  22 . Amm) 

'0  10  0  0 
0  0  I  ...  0  0 

4„  =  :  •  ■  (A,  x  K,  matrix) 

0  0  0.01 

0  0  0.0  0 

m 

I  K,  =  N. 

i  i 

B  =  block  diag  [6,,  b2 . 

h,  =  ( 0  ...  0  1)T  (K(  x  I  matrix) 

Definition  4 

I  is  said  to  be  locally  linearizable  at  (xc,  uc)  if  there  exist  indices  ]  K , ' ,  an  open 
neighbourhood  U  {a  Pv  +  m)  of  an  equilibrium  point  (x,,  ut)  and  a  diffcomorphism 
T  V  -» T(V)  such  that 

(i)  T=  (T1?  T2, ....  Tn)  are  functions  of  x,,  x_ . xN  only, 

(ii)  T(xt,  uc)  =  0(Ar  +  m)x  i 
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If  we  let 

then  y(r)  and  r(r)  satisfy  the  relation  £0- 
Definition  5 

£  is  said  to  be  approximately  linearizable  with  order  p  if  there  exist  indices  ,, 
an  open  neighbourhood  l/(cR*“'l  of  an  equilibrium  point  (xc,  ut)  and  a  dif- 
feomorphism  TV-*  T{V)  such  that 

(i)  T=  (T,,  T2 . Tv)  are  functions  of  x,,  x2 . xN  only, 

(ii)  T(x«,  ue)  =  , 

(iii)  T  /  =  *  T+Ofx-X',u-utrt‘ 

Thus  in  Definition  5  we  consider  the  following  nearly  linear  multi-input  discrete¬ 
time  system: 

£o:  >lf  +  1 )  =  Ayft )  +  Br(f)  +  0(x  -  xe,  u  -  *  1 

where  the  N  x  N  matrix  A  and  N  x  m  matrix  B  are  as  in  £0. 

Now  we  state  the  generalized  version  of  Lemmas  1  and  2  and  Theorems  3  and  4. 
Just  as  in  the  single-input  case,  we  can  assume /( 0, 0)  =  0  without  loss  of  generality,  if  / 
has  an  equilibrium  point.  Also,  we  define  /'(x,  u)  in  the  same  way  as  in  the  previous 
section. 

Lemma  5 

£  is  locally  linearizable  at  (0,  0)  if  and  only  if  there  exist  {K,  }J"=  i  and  C*  functions 
h,(x),  /i2(x). . h„(x):  lV(c  R*)-*  IR  such  that 

(i|  W  is  an  open  neighbourhood  of  0ePN, 

(ii)  Dy(hj  /')  s  0  for  1  ^  m  and  1  ^  i  <  kt  —  1 

(iii) 
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V  dtt  /  (0.0 


/d(hm  /*-)\ 

_V  /  (0.0)  _ 

(v)  h/0)  =  0  for  1  <  m 

Let  £  =  (.vT  wT)T.  Thus  £  is  a  (N  +  m)  x  1  vector. 

Lemma  6 

I  is  approximately  linearizable  with  order  p  if  and  only  if  there  exist  {K,}J"= ,  and 
CT  functions  h,(x).  h2(x) . hjx):  Wfe  RN)-»  R  such  that 

(i)  W  is  an  open  neighbourhood  of  Oe  R*. 

(ii)  (D^D^h,  ~f')\0,0)  =  0  for  1  <  j  ^  m,  1  $  i  ^  kj  -  1,  0  $  k  $  p  -  1 

(iii) ,  (iv)  and  (v)  of  Lemma  S  are  satisfied. 


(v\  {*)“"(#.)  (l\ 

\^x)to.Oi\^ui /(O.O)  U/<o.oi  \5u,/(00)  \du2J( o,o) 

(VY1  'l(JL)  (JL)  X 

*A/<0.0>  VU2/<0.0)  \<1u"./(0.0)  VX/(0.0)  \^Um/(0.0)J 

'_v\  m  (VY'~2( JL)  (JL) 

V<1ul/(0.0l  VVlO.OI  VUt/<0.0>  \^XJt0.0)\^Ul/(0.O)  \^u2/(0.0) 

WY<y\  /<y\  (<j\k,  y )  ..  , 


V  X/(0.0I  VU2/I0.0I  V  Um/<0.0|  \^x/(0.0*  V^iii/lO.OlJ 

Suppose  that  the  elements  of  E  are  linearly  independent;  that  is,  they  form  a  basis 

I 

for  R'\  Let  a,  =  £  K>  for  1  <  i  ^  m.  Define  £'{i>) :  R*  -*  R  by  C(v)  =  ot„r  where  v  is  a 

j  -  * 

1  x  .V  row  vector  and 


rT  =  x, 


+  ...  +  a„ 


df\K'-'(df 


\^*/(0.0)  \^W1  7(0.1 


+  «.,♦!  I  57"  I  + 

i  \^U2/<0.0) 


+  M/)V  j  \,-iu  J  |  I  +  ... 

VX/<O.OI  VU2/(0,0)  \<Mm/(0.0) 

/<yy-1/  d/\ 

+  a-UJ  Ur 

V  -X/(0.0I  v  Mm/(0.0) 


Also,  define  £' :  R'”"v->  Rp  for  i  =  1,  2 . m  by 


rm=(nr,)  ...  c*(r.»T 
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where  Vj  is  the  jth  row  of  V.  Let 

fi-fl 


o.o, 


Also,  let  >’i  =  ((/f'i )T  (A)T  ...  (#)T)T.  Let 


Di  = 


D|,  D!2 
Di.  D\2 

D'k,  D'k2 


where 


tyj  ~  Om(l|  -  1  )(\  +  jh)'  x  Ni  * 1 


D\,=  © 


U+D! 


(fl/Ao.o, 

(B5)(o, 


0) 


(Bfj-'Wo, 


D'kk 
if  /  <  j 

if  i  >  j 


Theorem  7 

Z  is  approximately  linearizable  with  order  p  ( ^  2)  if  and  only  if  there  exist  {£,  }•"=  t 
such  that 

(i)  the  elements  of  £  are  linearly  independent, 

(ii)  span  £,  =  span  (£,  n  £)  for  1  ^  i  ^  m 

(iii)  y‘p  - 1  e  Image  (Dj,  _ , )  for  1  ^  m 

Remark 

Z  is  approximately  linearizable  with  order  1  if  and  only  if  (i)  and  (ii)  of  Theorem  7 
hold,  just  as  in  the  continuous-time  case  (Krener  1984).  If  m  =  1  (single-input  case)  then 
-  N.  Thus  (i)  of  Theorem  7  is  the  same  as  (i)  of  Theorem  3.  Since  £,  =  £,  n£, 
(ii)  of  Theorem  7  is  trivially  satisfied.  Since  the  operator  is  the  same  as  the  opera¬ 
tor  £  in  the  previous  section,  ~Pp-,.  Since  =  Ap_,.  (iii)  of  Theorem  7 
is  the  same  as  (ii)  of  Theorem  3.  Therefore  Theorem  7  is  a  generalized  version  of 
Theorem  3. 

Now  a  sufficient  condition  for  local  linearizability  is  given  in  the  following 
theorem. 

Theorem  8 

Suppose  that  /(.v,  u)  of  Z  is  an  analytic  IR^-valued  function.  Z  is  locally  linearizable 
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at  (0, 0)  if 

(i)  the  elements  of  E  are  linearly  independent, 

(ii)  span  E,  =  span  (Ei  nE)  for  1  ^  i  <  m 

(iii)  there  exists  fc(<  x)  such  that  yjespan ((£{)*)  for  1  and  1,  where 

(£')*  is  composed  of  the  first  k  columns  of  D\. 

Given  the  system  (4),  we  choose  the  Kronecker  indices  i  in  a  similar  way  to 
the  continuous-time  case  (Hunt  and  Su  1981).  First  we  form  the  matrix 


Let  a,  be  the  number  of  linearly  independent  vectors  in  the  first  i  +  1  rows  for  0^  i 
^  N  -  1.  Take  y0  =  a0  and  y,  =  a,  -  a,  ,  for  1  ^  i  ^  N  -  1,  and  define  K,  to  be  the 
number  of  yt  with  yf  ^  /. 
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Appendix 

Lemma  A.l 

Equation  (2)  holds. 

Proof 

Clearly, 

D7D„(/i  <’/')  = 

1  =  0 

D?  *  1  DJh  >/‘)  =  ”l  (Bi.  t 1 
1  =  0 

where  {/?!*,}  are  to  be  determined.  Then 

B'm.l0  =  D.(B'm,0)  (Al) 

i.i  =  D.(B'm  l)  +  D.fiT ®  B'm  l  ,  for  1  $  /  ^  m  (A  2) 

(A3) 

By  (A  I),  since  BJ,  0  =  Dy/,T.  B'm  0  =  D?Dyfn  for  m  Ss  1.  Note  that  (2)  is  true  when 
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m  =  1.  Now  suppose  that  (2)  is  true  for  m  ^  p.  Let  1  $  /  p.  Then,  by  (A  2), 

=  'l  £  -  I'oj*1  ‘'(D.r'ODV  ‘•■(D./'T®  ...®DV  '  *■ 


*i  =  l  *1  -  l  *i  -  1 


P  *  I  -  /  *  I  *i  *i 


(D./,t®DV  'DJ'T )...))+  £  Z  ■  Z  D.fn®Dp ** 

*i  =  i  *i=i  *.  1  =  1 

(D,/,t®D‘.1  ‘-(D./"®  ...  ®DV  •’  *'  '(D./"®DV  •  1 0U/‘T) ...» 

Changing  the  dummy  variables  A,.  . Ac,  ,  of  the  second  term  into  A3 . A„ 

respectively,  the  second  term  becomes 

I  I  ...  I  D.f‘l®Dr'  l“(D.fn®D\!  *‘(0./'T®  ...  ® (£>*.'  ■-*' 

*,  =  i  *i=i  *,  =  i  s 

P+1  1*1  *i  *.'  h  i 

(D./,t®OV  1D„/,t)...))=  £  III  1  ,+ 1  '*• 

*.  =  p*  i  i=  i  *,  =  i  *,  =  i  *,  i 

(D./,T®DV  l-(D4/,T®D‘.!  *’(£>./,t®  ...  ® DV  1  ‘'(D./,t®DV  1  D„/lT) ...)) 

Thus  (2)  is  true  for  m  =  p  +  I  and  I  <  /  <  p.  By  (A  3).  it  is  easy  to  see  that  (2)  is  true  for 
m  =  l  =  p  +  1.  Hence  (2)  is  true  for  m  =  p  +  1.  By  induction.  (2)  is  true  for  I  □ 

Let  /i(.x):Rv->R  be  a  C’  function. 


Lemma  A. 2 

If 


(S|  S2  ...  S,| 


<OXo 

(OX-0 


|<o‘,*X=„  J 

where  S,  is  a  p  x  /V‘  * 1  matrix  for  1  s£  t  A.  then  de  Image  I fl).  where 


*-((?*•)•(?*■) . C®,,s*)) 


Proof 

Equation  (A  4)  is  equivalent  to 


Consider 


S,(0^),-o  +  .^(O^L-o+  +S1(Di*X.0  =  </ 


S|U>X-o  =  <T 


Note  that  (C2Ai 4  /i,y  =  h,,  A  [f  'h f  x/x,),  for  1  $  i  $  V  and  I  $/s:,V. 

Let  (v, ),  be  the  ith  column  of  S, .  Then,  since  htj  =  hJ(. 


S,(DfJi)x^0—  £  (vi)(<i,  +  £  £  ((.s i ), 

0|  x  I  fli  =  1  aj  -  ai  ♦  I 

+  (.v, 

)(a^  IlN’  4  at 


/  I  J.V  ♦  a: 
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Therefore  d  espan((3).  where 

Q  =  {  U  U  !Sl«i  I  IV  +  'lu.  I  >V  ♦  u,  1  ) U  iS'"i  IIS-ill) 

\dj  -  1  <l»  -U|  ♦  I  /  \(J|  -  1  / 


Now  consider  the  matrix  S',  defined  by 

S',  =  ©S,  =S|<l'12  +  l/2l> 


It  is  easy  to  see  that  the  ((<i,  -  1  ).V  +  u,)th  column  of  S, 

f  -f'l  l(u,  i  ».v » a ,  if  a,  =a2 


tS  I  1| a  i 


ffsl  l|u,  1 1\  ♦  a:  +  (.V,  )(a,  li\*a,  ifu,  #  tf. 


where  1  $  u,  $  S'  and  1  $  a2  $  S’.  Clearly  Image  (S', )  =  span  (Q|.  Similar  arguments 
can  be  applied  for  (D  '#i)a  _  0 . (Di’ 1  h)x  Therefore  d e  Image  |B|. 
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A  Structure-Independent  Approach  to  the 
Analysis  and  Synthesis  of  Recursive  Linear 
Time- Variant  Digital  Filters 
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Abstract — This  paper  develops  a  systematic  approach  to  select  a  time- 
dependent  state  transf  nmation  which  can  map  a  linear  time- variant  (LTV) 
digital  filter  to  an  equivalent  filter  having  diagonal  state-feedback  matrices. 
Due  to  the  structural  simplicity  of  the  diagonal  systems,  this  time-depen 
dent  state  transformation  is  a  convenient  tool  for  analyzing  recursive  LTV 
filters  expressible  in  the  state- variable  form.  In  this  paper,  we  discuss  both 
the  theoretical  basis  and  the  application  of  this  diagonali/ation  procedure 
The  properties  of  two  types  of  recursive  LTV  filters  are  examined  by  using 
this  state  transformation  technique.  Based  upon  the  separable  properties  of 
the  impulse  responses,  we  have  explored  a  new  algorithm  for  synthesizing 
desired  impulse  responses  with  a  major  class  of  recursive  LTV  filters.  This 
technique,  though  suboptimal,  can  substantially  reduce  the  computation 
required  in  the  synthesis  procedure. 

I.  Introduction 

N  RECENT  YEARS,  there  has  been  considerable  inter¬ 
est  in  the  analysis  and  svnthesis  of  linear  time-variant 
(LEV)  digital  filters  for  processing  signals  whose  character¬ 
istics  change  significantly  with  time.  Generally  speaking,  a 
straightforward  extension  of  a  synthesis  technique  for  lin¬ 
ear  time-invariant  (LTI)  filters  is  sufficient  for  implement¬ 
ing  I ,  I  V  filters  which  have  finite-duration  impulse  re¬ 
sponses.  However,  the  use  of  a  recursive  structure  has  the 
advantages  of  saving  compulation  lime  and  storage  space, 
if  the  duration  of  the  desired  impulse  response  of  an  LTV 
filter  is  relatively  long.  But  the  synthesis  of  a  recursive 
LTV  filter  is  difficult  because  the  characteristics  of  an 
1  TV  filter  are  related  to  the  filter  coefficients  in  a  com¬ 
plicated  fashion,  except  for  certain  filters  implemented 
with  some  special  structures  |1).  Some  researchers  have 
suggested  a  very  simple  but  somewhat  heuristic  synthesis 
method,  which  is  based  on  the  implementation  of  the 
frozen-time  transfer  function  of  an  LTV  filter  [2],  (3). 
However,  as  illustrated  in  [4],  noticeable  differences  be¬ 
tween  the  desired  and  the  realized  filter  characteristics  of 
an  1.7  V  filler  exist,  unless  the  filter  coefficients  change 
scr\  slowly  with  time 

I  he  basic  properties  of  a  continuous-time  LTV  system 
realizable  as  a  differential  equation  have  been  investigated 
m  the  literature  (5)  [7],  Recently,  the  properties  of  the 
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discrete-time  counterpart  have  also  been  reported  [1),  [8). 
But  the  previous  studies  of  discrete-time  recursive  LTV 
filters  are  limited  to  those  filters  realized  with  time-variant 
difference  equations.  Since  there  are  many  other  conceiv¬ 
able  structures  which  can  be  utilized  to  implement  an  LTV 
filter,  a  structure-independent  approach  to  the  analysis 
and  synthesis  of  recursive  LTV  filters  is  desirable.  Unlike 
the  implementation  of  a  recursive  LTI  filter,  the  structure 
selected  for  implementing  a  recursive  LTV  filter  is  an 
important  factor  in  determining  the  characteristics  of  the 
realized  filter. 

The  main  objective  of  this  paper  is  to  develop  a  struc¬ 
ture-independent  approach  to  analyze  and  synthesize  re¬ 
cursive  LTV  filters.  Therefore,  we  express  the  basic  model 
of  recursive  LTV  filters  in  the  state-variable  form,  which  is 
capable  of  representing  most  recursive  LTV  filters.  A 
time-dependent  state  transformation  is  devised  to  reduce 
the  complexity  of  the  state-variable  model  of  LTV  filters. 
We  then  examine  the  basic  properties  of  recursive  LTV 
filters  hv  utilizing  the  time-dependent  state  transforma¬ 
tion.  Further,  we  explore  the  solutions  to  the  synthesis 
problem  of  realizing  an  LTV  impulse  response  with  a 
finite-order  recursive  digital  filter. 

In  Section  II.  we  first  introduce  several  descriptions  of 
LTV  digital  fillers,  and  discuss  the  properties  of  those 
representations  relevant  to  the  synthesis  of  recursive  LTV 
filters.  In  Section  III.  attention  is  devoted  to  the  analysis 
of  LTV  filters  represented  in  the  state-variable  form.  A 
time-dependent  state  transformation  that  diagonalizes  the 
state-feedback  matrices  has  been  developed  so  that  an 
LTV  filter  expressed  in  the  state-variable  form  can  be 
transformed  into  a  filter  consisting  of  K  parallel  first-order 
filters.  This  leads  to  a  general  expression  for  the  impulse 
response  realizable  via  a  recursive  LTV  filter.  In  Section 
IV,  we  illustrate  how  this  diagonalization  procedure  can  he 
applied  to  analyze  the  properties  of  LTV  filters  realized 
with  different  filter  structures.  In  Section  V.  we  formulate 
the  time-domain  synthesis  problem  of  a  recursive  LTV 
filter  by  minimizing  the  squared  difference  between  the 
desired  impulse  response  and  the  impulse  response  realiz¬ 
able  as  a  major  class  of  recursive  LTV  filters.  The  numeri¬ 
cal  difficulty  of  obtaining  the  optimal  solution  is  ex¬ 
amined.  An  efficient  suboptimal  algorithm  based  on  the 
minimization  of  the  localized  squared  difference  between 
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the  desired  impulse  response  and  the  realized  impulse 
response  is  also  developed.  A  numerical  example  has  been 
selected  as  an  illustration  of  this  synthesis  algorithm, 

II.  Characterizations  of  LTV  Filters 

Generally  speaking,  most  methods  for  describing  LTV 
filters  are  evolved  from  those  used  for  LTI  filters.  A 
common  time-domain  description  for  LTV  filters  is  the 
time-variant  impulse  response,  which  is  defined  as  the 
output  measured  at  the  instant  n  in  response  to  a  unit- 
sample  input  applied  at  instant  m.  Then,  the  input  x(n) 
and  the  output  y(n)  are  related  to  the  impulse  response 
h(n.m)  by  the  summation 

X 

v(n)=  £  h(n,  m)x(m)  (1) 

m  *■  -  oc 

where  the  filter  is  said  to  be  causal  if  the  impulse  response 
h(n.m)  satisfies 

h(n.m)  =0,  for  non.  (2) 

If  one  considers  the  computation  and  storage  require¬ 
ments  of  implementing  an  LTV  filter  having  a  long-dura¬ 
tion  impulse  response,  it  is  desirable  to  synthesize  the  filter 
using  some  recursive  filter  structure.  A  widely  used  struc¬ 
ture  for  implementing  LTV  filters  is  a  time-variant  dif¬ 
ference  equation,  which  relates  the  output  sequence  y(n) 
to  the  input  sequence  x(n)  by 

£  a,(n)y(n  -  i)  =  £  b,(n)x(n  - /)  (3) 

<  -  n  <  =  » 

where  a„(n)  ¥=  0  for  all  n  and  the  order  of  the  difference 
equation  is  equal  to  K  if  aK(> i)*0  for  some  n.  By 
adopting  the  direct  form  II  structure  [9]  used  in  the 
synthesis  of  LTI  filters,  one  can  define  another  LTV  filter 
structure  in  terms  of  a  two-stage  difference  equation 

K 

w(n)  =  -  £  a,(n)w(n  - /)  +  x(n) 

i~\ 

y(n)  =  £  c,(n)w(n-i).  (4) 

(  -o 

The  block  diagram  of  the  direct  form  II  realization  of  LTV 
filters  is  shown  in  Fig.  1.  Many  other  filter  structures  can 
be  developed  in  the  same  way.  However,  unlike  the  case  of 
representing  recursive  LTI  filters,  it  is  difficult  to  establish 
the  explicit  relationships  among  the  recursive  LTV  filters 
realized  with  different  structures. 

For  the  purpose  of  analyzing  recursive  LTV  filters 
synthesized  with  a  variety  of  structures,  we  focus  our 
attention  on  the  state-variable  representation  in  the  pres¬ 
ent  study.  Assuming  that  x(n)  and  >>(/>)  denote,  respec- 


u,(w) 
U|(w  -1) 


u2(m) 
u2(w  - 1) 


Fig.  1.  Block  diagram  of  the  direct  form  II  realization  of  a  recursive 
LTV  filler. 


tively,  the  input  and  the  output  of  the  filter,  the  input- 
output  relationship  of  a  recursive  LTV  filter  can  be  ex¬ 
pressed  in  terms  of  the  state  equations 


y(n)-C(n)W(n) 


(5) 


where  W(n)  is  the  state  vector  and  A(n),  B(n)  and  C(n) 
are  matrices  of  appropriate  dimensions.  It  is  clear  that  a 
recursive  LTV  filter  realized  via  a  difference  equation  can 
be  easily  expressed  in  the  state-variable  form  by  choosing 
an  appropriate  set  of  variables  a«  the  state  vector.  Further, 
the  state-variable  form  is  very  suitable  for  representing  a 
recursive  LTV  filter  with  multiple  inputs  and  multiple 
outputs. 

The  basic  properties  of  an  LTV  filter  realizable  as  a 
time-variant  difference  equation  has  been  explored  by 
Huang  and  Aggarwal  [1|.  The  main  result  of  Huang  and 
Aggarwal's  work  relevant  to  our  present  study  is  stated 
below.  The  time-variant  impulse  response  of  a  recursive 
LTV  difference  equation  given  in  (3)  is  a  /fth-order  causal 
separable  sequence  of  the  form 


h(n.m)  =  {  "S*"1  (6) 

elsewhere 


i-i 

10. 


where  u, («),/=  1,2,-  •  K  are  K  independent  solutions  of 


£  a, (n)y(n  -  i)  =  0 
/  -  0 


uK  ,(m) 
uK  ,( w  -  1 ) 


uK(m) 


,(m-A  +  l)  uK(m  ~  K  -t  1 ) 


(7) 


and  vt(m)  is  given  by 

i,  D (m  +  k) 

v,(m)  =  £  bk(m  +  k)— — — —  (8) 

*  _ o  D(m+k) 

where  D(m)  denotes  the  determinant  of 


(9) 


M,(m  -  K  +  1) 
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and  L>  ( m )  is  the  cofactor  of  the  element  ut(m)  of  the 
determinant  Dim).  An  implicit  formulation  of  this  result 
can  also  he  found  in  [8).  However,  after  careful  examina¬ 
tion.  we  find  that  the  expression  in  (6)  holds  true  for  all  n 
and  in  onlv  when  I.  ^  K  -  1  and  tiA(»i)*0  for  all  n 
Ihereforc.  we  suspect  that  there  exists  a  more  generalized 
expression  for  the  impulse  response  of  an  LTV  difference 
equation  In  addition  to  the  time-variant  difference  equa¬ 
tion.  we  are  also  interested  in  anakzing  LTV  filters  real¬ 
ized  with  other  filter  structures  Consequently,  the  devel¬ 
opment  in  this  paper  is  based  upon  the  state-variable 
model  for  i  f A  filters  The  detailed  derivations  associated 
with  the  state-variable  model  are  discussed  in  the  following 
section 


III  Dl  XliONAI  l/IM.  iRVNsKlKMxriOS.  H>R 
Sn  tt-VxRiABi  i  Modi  t 


I  he  technique  of  state  transformation  is  a  very  conveni¬ 
ent  tvtol  for  analyzing  linear  models  represented  in  the 
state-variable  form  In  most  cases,  we  want  to  reduce  the 
state-feedback  matrix  to  a  diagonal  matrix  or  a  diagonal- 
shape  matrix  consisting  of  Jordan  hL>cks.  Methods  for 
selecting  state  transformation  matrices  for  a  continuous- 
tune  av  .tern  have  been  discussed  in  [10],  But  these  proce¬ 
dures  are  not  applicable  to  discrete-time  filters  having 
singular  state-transition  matrices.  Therefore,  we  devote  our 
el  fort  to  develop  a  new  procedure  for  selecting  transforma¬ 
tion  matrices  for  a  general  discrete-time  LTV  filter. 

l  et  s  drst  restrict  our  consideration  to  a  single-input 
single-output  LTV  filter  For  the  k  th-order  filter  given  by 
m).  the  matrices  A(n),  B{tt)  and  C(  n )  are  of  dimension 
k  *  k.  K  x  !.  and  1  x  K.  respectively.  After  the  input 
v ( w )  m  is  substituted  with  a  shifted  unit  impulse 
tit  it  hi  }.  the  impulse  response  of  the  filter  can  he  derived 

as 


C  in 


h  (  n .  m  ) 


n  m  \ 

FI  A  (  n  -  j  } 

!  :  U 


B  (  m  ) ,  n  ^  m  +  1 


C(n)B(n), 

t). 


n  =  in 
n  <  m. 


(10) 


Because  the  impulse  response  given  in  (10)  is  a  com¬ 
plicated  expression  involving  multiple  matrix  multipli¬ 
cations.  it  is  difficult  to  examine  the  properties  of  the 
impulse  response  without  further  simplification. 

One  wav  to  circumvent  this  difficulty  is  to  select  a 
time-dependent  state  transformation  that  can  transform  all 
the  state-feedback  matrices  Ain)  into  diagonal  matrices. 
Assume  the  new  state  vector  Win)  is  related  to  the 
original  state  vector  BTn)  by 

W(n)  =  P(n)W*(n)  (11) 

where  Pin)  is  a  nonsingular  transformation  matrix.  After 
substituting  (11)  into  the  original  state  equation  (5)  and 
rearranging  the  result,  we  can  define  an  equivalent  filter 


characterized  by  the  state  equations 

W(n)  =  A*(n)W(n-  1)  +  B*(n)x(n) 

»■(«)  =  C  *(«)*♦  *(«)  (12) 

where 

A’{n)  =  P  '{n)A(n)P(n  -  1) 
fl*(n)  =  P  '(n)B(n) 

C*in)=C{n)P(n).  (13) 

C'onsequentlv .  the  expression  for  the  impulse  response 
given  in  (10)  can  also  he  rewritten  for  the  equivalent  filter 
as 

h  (  n .  hi ) 

B  *  (  m  ) .  n  ■  in  +  1 

It  =  HI 
II  <  HI 

(14) 

If  all  the  state-feedback  matrices  ,4*'n)  are  of  diagonal 
form,  then  the  original  filter  has  been  decomposed  into  k 
parallel  first-order  filters  In  this  wav.  we  can  represent 
each  recursive  LTV  filter  in  terms  of  an  equivalent  di¬ 
agonalized  filter  Hence  the  diagonalizing  transformation 
is  an  useful  tool  for  the  analysis  of  recursive  LTV  filters  of 
different  structure >.  Next,  we  wjl)  show  that  such  a  di¬ 
agonalizing  transformation  exists  for  every  recursive  L  TV 
filter  represented  in  the  state-variable  form. 

Assuming  that  the  system  matrices  of  an  LTV  filter  are 
defined  for  all  n  such  that  A/0  ^  n  5  .V0.  we  should  he  able 
to  choose  the  transformation  matrices  Pin)  for  A/„  -  1<  n 
<  ,V()  in  order  to  completely  define  the  equivalent  filter 
l  et  us  define  the  forward  state-transition  matrix  *!>(.>!.  m) 
of  the  original  filter  as 

ft  nt  J 

n  Ain  -  j) 

i  ~  ii 

/. 

and  let  S^n)  denote  the  linear  vector  space  consisting  of 
all  the  column  vector  q  such  that 

<J>(  n  +  J.n)q  -  0.  (16) 

We  also  choose  A(  Ma  -  1 )  =  A(  N0  +  1 )  =  0  to  facilitate 
the  illustration  of  this  algorithm.  Then  the  transformation 
matrices  P{  n )  may  be  chosen  in  sequence  by  carry  ing  out 
the  following  steps  from  n  =  M0-  1  to  n  =  N0: 

1)  Let  P,(n)  denote  the  ith  column  of  the  transforma¬ 
tion  matrix  P(n).  Then  choose  P,in)  =  A(n)P,(n  -  1)  if 
A{n)Pi{n  -  1)  *  0.  where  /  =  1,2,  ,  K.  Let  l(n)  denote 

the  set  of  index  /  such  that  Ain)Pt{n  - 1)  *■  0  and  let 
ttin)  denote  the  linear  expansion  of  the  column  vectors 
P,(n)  for  i  e  l{n).  The  rest  of  the  column  vectors  in  Pin) 
are  to  be  determined  in  the  next  step. 


n  >  m 


(15) 


f  *<  n  I 


FI  A'in  -  j) 


Cmt  n  >«*<  n) 
0. 
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2)  For  j  =  1,2,-  •  -,  N0  —  n  +  1,  define 
S/(n )  =  the  linear  space  generated  by  the  vectors 

in  S/(n)u  Q(n)  (17) 

and 

Tl(n)  =  SJ(n)±S? 

=  {  q\q  £S((n)  and  q1  p  =  0 

for  every  p  e  S(r  ,( n )} .  (18) 

Let  {,?*,(« )|1  <  k  ^  /C((n)}  be  a  basis  of  the  subspace 
Tt{n)  and  k =  1,2.- •  •,  be  an  arbitrary 

vector  set  in  S  ,( n ).  For  j  =  1, 2,  •  •  • ,  ,V„  -  n  +  1 ,  we  choose 

,( 'i) +  tJ* /('! )•  1  ^  k  ^  Kf{n)  as  column  sectors  of  the 
transformation  matrix  P(/i)  if  Tpn)  *  (0).  The  column 
vectors  selected  in  this  step  will  exactly  fill  the  empty 
columns  of  P(n)  which  have  not  been  determined  in 
step  1 

The  basic  properties  of  the  state  transformation  defined 
by  the  procedure  stated  above  can  be  summarized  in  the 
following  two  theorems.  The  proofs  are  given  in  the  Ap¬ 
pendix. 

Theorem  I :  Assuming  that  a  discrete  LTV  filter  repre¬ 
sented  in  the  state-variable  form  of  (5)  is  defined  in  the 
interval  A/„  $  n  Nn.  then  the  state  transformation  matrix 
/*(  n )  selected  with  the  iterative  procedure  stated  above  has 
the  following  properties: 

1 )  The  matrix  P{  n )  is  of  full  rank. 

2)  Fach  column  of  P{n)  belongs  to  one  of  the  dis¬ 
jointed  sets 

A.V,(n)  =$,(»)- S,  ,(») 

=  {q\qe  S,(«)  and  q&S,  ,(«)}  (19) 

where  /  =  1 . 2,  •  •  ■ .  -  n  +  1 .  The  number  of  the  column 

vectors  of  P(n)  belonging  to  A Sf  is  equal  to  the  difference 
between  the  dimensions  of  S;(n)  and  S,  ,(«). 

Theorem  2.  Under  the  same  assumptions  of  Theorem  1. 
there  exists  an  equivalent  filter  given  in  (12)  such  that  the 
new  state-feedback  matrix  is  of  the  form 


/!*(«)  =  diag[a,(n  )  a2(/t) 


«*(")] 


where  <»,(«)  =  0  or  1  for  /  =  1.2.-  .  A'. 

The  essence  of  these  two  theorems  is  that  once  an  LTV 
filter  of  the  state  variable  form  is  completely  specified  in 
an  interval,  there  exists  a  time-dependent  state  transforma¬ 
tion  which  can  define  an  equivalent  filter  having  diagonal 
state-feedback  matrices  in  the  same  interval.  However,  the 
state  transformation  thus  determined  is  not  unique  be¬ 
cause  in  step  2  of  the  selection  procedure,  we  have  some 


f;ig  2  Block  diagram  of  the  equivalent  filter  having  a  diagonal  state* 
feedback  matrix 


freedom  in  choosing  the  column  vectors  and  assigning 
them  to  respective  column  positions.  Even  though  selecting 
different  sets  of  transformation  matrices  may  generate 
different  diagonalized  filters,  the  input-output  characteris¬ 
tics  of  these  filters  remain  the  same. 

A  general  expression  for  the  impulse  response  can  be 
derived  from  the  result  in  Theorem  2.  Since  the  state- 
feedback  matrix  of  the  equivalent  filter  has  the  form  of 
(20).  the  corresponding  forward  state-transition  matrix  of 
the  equivalent  filter  is  given  as 

$*(/!.  m)  =  diag[/3,(n.  m)  02(n.m)  ■■■  PK(n.m )] 

(21) 

where 


In  m  1 

PI  a,(n  -  j ).  it  >  m  +  1 

(22) 

1 .  n  =  m 

0.  n  <  m 

for  <  =  I.2.-  •  •.  K.  After  substituting  (21)  into  (14).  we  can 
derive  the  impulse  response  of  the  LTV  filter  as 

h(n,m)  =  C*(n)Q>*(n,m)B*(m) 

=  L  t'*(n)0l(n%m)h*(m)  (23) 

I  *=  I 

where  C*(n)  =  C(n  )P(n).  B*(m)=P  \m)B(m)  and 
«*(«)  and  h*(m )  denote  ah  elements  of  C*(«)  and 
B*(m).  respectively.  This  result  can  also  be  easily  verified 
by  examining  the  block  diagram  of  the  equivalent  LTV 
filter  shown  in  Fig.  2. 

The  expression  for  the  time-variant  impulse  response 
shown  in  (23)  somewhat  resembles  the  result  given  in  (6). 
which  is  obtained  specifically  for  time-variant  difference 
equations.  However,  an  additional  weighting  factor 
pjn.m),  which  is  a  function  of  m  and  n.  has  been 
included  in  the  summation  of  (23).  Further,  for  some  LTV 
fillers,  certain  terms  in  (23)  might  cancel.  This  leads  to  the 
situation  that  fewer  terms  in  (23)  really  contribute  to  the 
impulse  response.  This  property  will  be  illustrated  later  by 
applying  the  diagonalization  procedure  to  an  LTV  filter 
realized  as  a  time-variant  difference  equation. 
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IV.  Analysis  of  LTV  Filters  via 
Diagonalizing  Transformation 

Now,  the  usefulness  of  this  diagonalizing  transformation 
is  illustrated  by  applying  it  to  several  realization  schemes 
for  recursive  LTV  filters.  The  analysis  of  LTV  filters  using 
the  diagonalizing  transformation  is  usually  accomplished 
by  the  following  steps.  First,  we  represent  an  LTV  filter  in 
the  state-variable  form.  Then  we  can  obtain  a  set  of 
transformation  matrices  by  using  the  diagonalizing  proce¬ 
dure  discussed  in  Section  III.  The  properties  of  the  original 
filter  can  be  extracted  from  the  system  matrices  of  the 
corresponding  diagonalized  filter. 

The  first  example  is  an  LTV  filter  whose  state-feedback 
matrix  A (n)  is  nonsingular  for  all  n  such  that  A/0  <  n  ^  V0. 
Following  the  selection  procedure  discussed  in  Section  III, 
we  can  choose  the  transformation  matrix  P(  M0  - 1 )  as 
any  nonsingular  K  X  K  matrix  P0.  Then,  just  by  carrying 
out  step  1  of  the  selection  procedure,  we  can  select  the  rest 
of  the  transformation  matrices  as 

A 

After  substituting  (24)  into  (13),  we  can  derive  the  system 
matrices  of  the  equivalent  filter  as 

A9(n)  =  Pl(n)A(n)P(n -1)  =  / 

Bm(n)  =  p-'(n)B(n) 

C*(n)=C(n)P(n)  (25) 

where  M0^  n  <  N0.  The  impulse  response  of  such  a  filter 
can  be  obtained  from  (23)  as 


(n-j))P0,  for  M0an^N0.  (24) 


I  "  *0 

/><«)=  n 

l  y-o 


where 


^n(«)  = 


^nin)  = 


0  1  0 
1 


-«*(») 

bL(n) 


-a2{n)  -a,(n) 


0 


b2(n)  f>,(n) 


^2,(«)  =  [01 
0  1 


-4  22  (  «  )  = 


0 


1 


0 


By  using  the  diagonalization  procedure  discussed  earlier, 
the  transformation  matrix  P(n)  may  be  chosen  as 


P{n)  = 


pnin) 

pnin) 

0 

h(n  ~  M0) 

(29) 


where 


n  - 


J,  n  =  M0  —  1 

o  i  0 


lo, 

A  typical  example  of  such  an  LTV  filter  is  the  direct  form 
II  realization  of  the  recursive  LTV  filter  given  in  (4). 

The  second  example  is  the  time-variant  difference  equa¬ 
tion  shown  in  (3),  where  ag(n)  is  assumed  to  be  1.  In  this 
case,  the  state  vector  may  be  chosen  as 

W(n)  =  [y(n  -  K  +  1)  y(n-l)  y(n)  | 

x(n-L  +  \)  •••  x(n-l)  x(w)]r  (27) 

Note  that  the  dimension  of  the  state  vector  is  K  +  L. 
Then,  the  corresponding  system  matrices  of  the  filter  are 
found  to  be 

«(/t)  =  [0  •••  0  b0(n)  |  0  0  0  l]r 

C(n)  =  [0  0  •••  1  |  0  0  0] 

and 


^ii  (") 

^u(") 

^2l(«) 

*  n(n) 

n  >  m 


(26) 


n  <  m. 


hi  0)  = 


0  0 

Li  o 

MIN 


pnin)  =  -  Pu(n)  £ 


i-i 


L  Pn("  +  +  j) 


j- 1 


•Aft, ((»  +  /-  A/0))  +  l) 


+  <?(«) 


MIN 


I-  L  /0 (n  +  i-  M0) 


I  -  1 


<?(")“ 


Pl\in)  I-  1 1  (  0  ^  1 2  (  0  ^5  (  ^  Mo  1)< 


[0, 


N0>  n  >  M0 

n  -  M0  -  1 


MIN  =  minimum  (  L,  N0  -  n } 

loU)  =  isl(J)*[K'K]hU) 


where  A[>.  j)  is  an  Lx  L  matrix  having  a  single  nonzero 
element  1  at  the  position  (i,  j)  and  ((/))  denotes  the 
L-modulus  of  i. 

After  substituting  (29)  into  (13),  we  obtain  the  corre¬ 
sponding  system  matrices  of  the  equivalent  diagonalized 
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filter  as 


/<*(«)  =  P  '(n)A(n)P{  n  —  1 ) 


/ 

0 

0 

/  -  f„(  n  -  A#0 )  _ 

p  1 

(n)B(n) 

C*(n)  =C(n)P(n).  (30) 


Then  the  impulse  response  of  this  filter  can  be  obtained  by 
substituting  (30)  into  (23).  After  some  simplification  of  the 
resulting  expression,  the  impulse  response  is  of  the  form 


h(rt.m) 


mm {  /  .  n  m } 

I  [0  0  1  ]/*,,(  M  )#*!, 1  ( M  +  y ) 

/  -  0 

bj  ( m  +  j  )[0  •  •  •  0  1  ] ' ,  /i  ^  m 

0.  n  <  m 


(31) 


which  is  somewhat  different  from  the  expression  given  in 
(6).  But.  with  further  simplification,  it  can  be  shown  that 
(31 )  is  equivalent  to  (6)  for  L  <  K  -  1.  From  the  definition 
of  /»,(«)  given  in  (29),  it  is  clear  that  the  upper  A-l 
rows  of  /*,,(«)  are  obtained  by  shifting  each  of  the  corre¬ 
sponding  rows  of  Pu(n  -1)  upward  by  one  row.  This 
leads  to  the  relation 


(0  0-  -  1]/'u(m)P11i(mi  +  j  )[0  0  1]'  =0  (32) 

for  n  -  m  +  1  ^  j  $  ;i  -  m  +  K  -  1.  After  substituting  (32) 
for  n  -  m  +  1  ^  j  ^  L  into  (31).  we  have 


h  (  n .  ni  ) 


|  L  M"'  +  7  ) [0  ■  •  •  \  \P ,,(«)/* 1 1 1  ( mi  +  j ) 

I  /  ~  0 

[0  ■  •  •  1  ] ' .  n  >  ill 
0.  it  <  ni 


(33) 


according  to  some  error  criterion.  In  a  practical  problem, 
the  desired  impulse  response  may  be  determined  either  by 
solving  a  statistical  filtering  problem  or  by  using  some 
empirical  rules.  Since  the  computation  for  obtaining  the 
true  optimal  solution  grows  rapidly  as  the  order  of  the 
filter  or  the  duration  of  the  impulse  response  increases,  it 
is  necessary  to  develop  efficient  solutions  for  the  synthesis 
of  recursive  LTV  filters.  In  this  section,  we  introduce  an 
efficient  suboptimal  technique  which  is  based  upon  the 
minimization  of  the  squared  difference  between  the  desired 
impulse  response  and  the  synthesized  impulse  response  in 
localized  regions.  A  numerical  example  is  selected  to  il¬ 
lustrate  this  new  synthesis  technique. 

It  is  quite  difficult  to  synthesize  a  desired  impulse  re¬ 
sponse  in  terms  of  impulse  responses  given  in  (23)  that 
have  arbitrary  /?,(«,  m)’s.  In  order  to  obtain  a  manageable 
formulation  of  the  synthesis  problem,  we  restrict  our  con¬ 
sideration  to  a  subset  of  recursive  LTV  filters  whose 
impulse  responses  are  causal  ACth-order  separable  func¬ 
tions.  i.e.. 

/i(n.  m )  = 

'0.  elsewhere 


£  u,(  n )  r,(  in ).  n  >  ni 


n/ii 


where  {  m,(ji);  /  -  L-  •  ■.  K  }  and  (  r,(m);  /  =  1.-  ■  ■.  K  }  are 
two  sets  of  independent  functions.  Even  though  (34)  is  a 
special  case  of  the  expression  given  in  (23).  (34)  is  still 
capable  of  representing  a  major  class  of  the  recursive  LTV 
filters.  In  particular,  the  impulse  responses  of  most  recur¬ 
sive  LTV'  fillers  discussed  in  Section  IV  satisfy  (34).  As¬ 
suming  that  the  desired  causal  impulse  response  is 
h  „ (n.  m).  the  synthesis  of  the  recursive  LTV  filter  can 
then  be  formulated  as  the  minimization  of  the  squared 
error  function 


/)(<->) 


I  I 

n  -  O  in  -  0 


h„(n.  ni )  - 


A 

Y,  m,(«  )i,(m ) 

i  - 1 


(35) 


This  expression  is  equivalent  to  the  result  given  in  (6). 

From  the  above  discussion,  we  have  shown  that  the 
diagonalizing  transformation  is  a  valuable  tool  for  the 
analysis  of  digital  LTV  filters.  Therefore,  a  systematic 
approach  based  upon  the  diagonalizing  transformation  can 
be  developed  to  reduce  the  complexity  of  the  LTV  filters 
synthesized  with  a  variety  of  structures.  Since  the  original 
filter  is  state-to-state  equivalent  to  the  corresponding  di¬ 
agonalized  filter,  the  use  of  the  diagonalizing  transforma¬ 
tion  is  promising  in  such  areas  as  the  stability  analysis  and 
the  roundoff  noise  analysis  of  LTV  filters. 

V.  Synthesis  of  Rk  ursivf  LTV  Fii  tfrs 

In  this  section,  we  examine  the  time-domain  solutions  to 
the  deterministic  synthesis  problem  of  recursive  LTV  filters. 
Our  main  objective  is  to  develop  techniques  for  synthesiz¬ 
ing  a  desired  time-variant  impulse  response  with  a  recur¬ 
sive  LTV  filter  such  that  the  difference  between  the  desired 
and  the  synthesized  impulse  responses  is  minimized 


where  (-)  =  {  w,(  n ).  /*,(«  )|1  ^  i  <  K.  0  ^  n  ^  .V  )  is  a  set  of 
unknown  variables  and  K  is  the  order  of  the  recursive 
LTV  filter.  Once  we  determine  the  optimal  ©  which  mini¬ 
mizes  the  cost  function,  the  coefficients  of  the  recursive 
LTV  filter  synthesized  with  a  particular  filter  structure  can 
be  derived  from  the  optimal  ©.  An  unrestricted  nonlinear 
optimization  algorithm  may  be  applied  to  find  the  solution 
of  this  optimization  problem.  The  coefficients  u,(n)' s  and 
s  are  considered  as  the  unknown  variables  in  the 
optimization  process.  However,  since  the  number  of  un¬ 
known  parameters  is  proportional  to  the  order  of  the  filter 
times  the  duration  of  the  impulse  response,  it  is  impracti¬ 
cal  to  find  the  true  optimal  solution  of  (35)  for  an  LTV 
filter  with  a  large  K  or  a  large  N.  Therefore,  it  is  useful  to 
develop  some  efficient  suboptimal  techniques  for  solving 
this  nonlinear  minimization  problem. 

In  Huang  and  Aggarwal's  work  (11),  the  causal  condi¬ 
tion  in  (35)  is  removed  so  that  a  straightforward  algorithm 
can  be  applied  to  solve  this  filter  synthesis  problem.  The 
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new  error  function  is  defined  as 


D'(&)=  £  £  hD(n,  m)-  £  u,(n)r,(m)  .  (36) 

n  *  0  »i  “  0  .  i  —  1 

The  minimization  of  (36)  can  easily  be  solved  by  using  a 
procedure  that  was  originally  developed  for  finding  the 
spectral  decomposition  of  a  matrix  [12],  The  required 
computation  is  approximately  equivalent  to  that  of  obtain¬ 
ing  A  most  dominant  eigenvalues  and  eigenvectors  of  an 
( N  +  1 )  x  ( N  +  1 )  matrix. 

The  main  drawback  of  using  the  noncausal  error  func¬ 
tion  is  that  the  performance  of  this  synthesis  procedure 
depends  largely  on  how  we  choose  the  desired  impulse 
response  in  the  noncausal  region.  In  order  to  make  the 
result  obtained  by  the  spectral  decomposition  technique 
close  to  the  optimal  one.  the  function  hn(n,m)  in  the 
region  {(/i,  m)\  0  <  n  <  m  <  N }  must  be  selected  such  that 
v  v 

£  I  [hp(n.m)-  h OPT  ( n .  m )] 2  =  0  (37) 

«  a  I)  m  =  «  »  I 

where  h  OPT  (n.  in)  is  the  noncausal  separable  impulse  re¬ 
sponse  defined  by  the  optimal  0  that  minimizes  (35). 
Since  there  are  no  explicit  rules  for  making  a  good  guess  of 
the  noncausal  part  of  the  desired  impulse  response,  the 
spectral  decomposition  technique  often  achieves  less  than 
satisfactory  result.  It  usually  takes  a  high-order  recursive 
LTV  filter  to  make  a  good  approximation  of  the  desired 
impulse  response. 

To  circumvent  the  problems  in  the  nonlinear  optimiza¬ 
tion  technique  and  the  spectral  decomposition  technique,  a 
new  suboptimal  technique  for  solving  the  synthesis  prob¬ 
lem  of  recursive  LTV  filters  is  formulated  b\  minimizing 
(35)  in  a  localized  sense.  The  solution  of  11,(11)  for  /  = 
1.  ■  .A  is  obtained  bv  minimizing  the  localized  error 
function 


D,  ( n ) 


h  „(  n.m)  £  ii, t  ii  )i ,(  hi  ) 


under  the  conditions  that 

A 

hn(n.  m)  =  £  u,(n)r,(m). 

,  =  i 

for  m  =  n .  n  1 .  •  • .  n  -  A  +  1 .  ( 39 ) 

With  an  index  change  in  (39).  we  can  express  the  linear 
equations  for  obtaining  r  (w).  /  =  1.  .  A  as 


h  n{  m  +  j.  m )  =  £  u,(  m  +  /  )r,(  m). 


for  /  =0.1.  .  A  1.  (40) 

After  substituting  (39)  into  (3K).  we  have  that 

«  a  x  ]: 

D,(n)  =  £  h  n.m)-  £u,(«)r,(w)  (41) 

rtl  -  II  .  I  -*  l 

Differentiating  (41)  with  respect  to  u,(n)  and  setting  the 
result  equal  to  zero,  we  can  find  a  set  of  linear  equations 


Fig.  3  Regions  of  the  impulse  response  where  the  error  function 
evaluated  in  different  steps  of  the  localized  technique 


for  M,(M)’ S 


=  Z  hD(n.m)i\(m),  ;=1.  --.A.  (42) 

m  -  0 

Note  that  the  coefficients  uk(n)‘ s  in  (42)  can  be  easily 
calculated  once  we  determine  the  values  of  v,(m)  for 
i  =  1 .  •  .  A  and  m  =  0. 1 .  ••  • ,  n  -  A.  Therefore,  by  using 
(40)  and  (42)  iteratively,  we  can  determine  a  suboptimal 
set  of  «,(«)' s  and  t',(n)'s. 

The  constraints  given  in  (39)  ensure  that  the  synthesized 
impulse  response  is  the  same  as  the  desired  one  in  the 
region  0jj«-w<A-1.  Hence,  the  localized  synthesis 
technique  will  favor  the  synthesis  of  a  impulse  response 
having  dominant  components  along  the  diagonal  line  n  = 
in  Now  let  us  summarize  the  complete  algorithm  of  this 
suboptimal  solution  as  follows: 

1 )  Apple  the  nonlinear  optimization  technique  to  find 
the  optimal  solution  of  (35)  in  a  small  interval  [0.  .V, ).  For 
example,  the  Fletcher  Powell  technique  [13)  works  well  for 
a  small  <V, 

2)  Use  (42)  to  find  u,(n )  for  n  =  V,  +  1 . •  ■  • .  .V,  +  A  and 

i  =  1.  .  A 

3)  For  /  =  1.2. •  •  •.  S  -  Nt.  use  (40)  and  (42)  to  obtain 
the  solutions  for  i',(A,  +  j )  and  u,(  Af,  +  A  +  j).  i  = 
1.  •  .A. 

In  fact,  this  localized  minimization  technique  has  divided 
the  domain  of  the  impulse  response  into  four  distinct 
regions  as  shown  in  Fig.  3.  Regions  1  and  II  denote, 
respectively,  the  areas  where  steps  1  and  2  of  the  algorithm 
are  applied.  And  regions  III  and  IV  represent,  respectively, 
the  areas  where  (40)  and  (42)  are  applied  at  step  3  of  the 
algorithm.  The  computation  of  this  localized  minimization 
algorithm  at  each  sampling  point  is  equivalent  to  that  of 
solving  two  K  th-order  linear  equations. 

We  have  tested  both  the  spectral  decomposition  tech¬ 
nique  and  the  localized  minimization  technique  with  a 
variety  of  time-variant  impulse  responses.  The  results  show 
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tABLF  1 

PtlUORMANCt  iNDK  tSOt  TH1  SVNIHISIS  AtCOKllHMS  IOR  l  TV 

F ii  iirs 

- 1 

|  i  k 

D(K) 

i 

y  j 

SPECTRAL  DECOMPOSITION 

LOCALIZED  MINIMIZATION 

!  1 

0.74114 

0.17202 

0.1  !  2 

0.58250 

0.01805 

:  3 

0.4 6670 

0.00552 

;  4 

0,36559 

0.00275 

■ 

i 

j  i 

0,88489 

0.19549 

1  2 

0.77537 

0.05869 

0.05  :  3 

0.66461 

0.00403 

1  4 

i 

i 

0.60011 

0.00074 

that  the  localized  minimization  technique  consistently  per¬ 
forms  better  than  the  spectral  decomposition  technique  A 
numerical  example  is  selected  to  illustrate  this  situation 
The  impulse  response  of  the  desired  filter  is  chosen  as 

h n(  n.  m  )  =  h(  (  n/16,  m/16)  (43) 

where 

j  exp(  -  [o.02(  t  -  t  )‘  +  0.1/] 

sine  [2(  /  -  t)(1  —  yt )] ,  f^r 

'  0.  elsewhere 

(44) 

and  sincf.x )  =  sin(rrx)/(irx).  The  domain  of  the  desired 
impulse  response  under  consideration  is  limited  to  the 
region  where  0<n;gl28  and  0<m<128.  Basically,  the 
desired  impulse  response  is  a  truncated  sine  function  along 
the  axis  n  +  m  =  0,  but  with  time-variant  frequency  con¬ 
tents  and  an  accelerated  decay.  A  normalized  squared 
error  function 


D(  K  )  = 


£  £  (h(n,m)~  h p(n.  m))‘ 


12K 


L  L  hD(n-m) 


n  •  0  m  ■»  0 


(45) 


is  selected  as  the  measure  of  the  performance  of  the 
synthesis  algorithm  A  lower  value  of  the  normalized  error 
function  means  a  better  approximation  of  the  desired 
impulse  response.  Two  sets  of  the  normalized  errors  have 
been  obtained:  the  first  set  is  for  the  spectral  decomposi¬ 
tion  technique  in  which  the  noncausal  half  of  the  impulse 
response  is  assumed  to  be  symmetneal  to  the  causal  half: 
and  the  second  set  is  for  the  localized  minimization  tech 
nique.  Table  I  lists  the  norm?lized  error  indices  obtained 
for  the  desired  impulse  response  functions  with  y  values 
0.1  and  0.05.  The  desired  impulse  response  with  y  *  0  05, 
the  impulse  response  of  a  fourth-order  filter  synthesized 
with  the  localized  minimization  technique,  and  the  dif¬ 
ference  between  the  two  are  shown  in  Fig  4 

It  is  clearly  seen  from  Table  1  that  the  localized  synthe¬ 
sis  technique  achieves  much  better  results  than  the  spectral 


Fig  4<a)  Desired  impulse  response  for  >  -  u  sn  the  numerical  exam 
pie  (b)  Impulse  response  synthesized  with  a  foirth  order  l  T\  filter 
using,  the  localized  minimization  technique  («.  »  Difference  between  the 
desired  impulse  response  and  the  ssnthesi/ed  impulse  response 
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decomposition  technique  does  We  also  attempt  to  find  the 
optimal  solution  of  minimizing  1 15)  with  a  general  nonlin¬ 
ear  minimi/aiion  algorithm  so  that  we  can  evaluate  the 
results  obtained  bv  those  suboptimal  techniques  After 
experimenting  with  a  numbei  of  initial  conditions,  we  find 
that  the  performance  index  settles  at  a  much  larger  value 
than  that  obtained  bv  the  localized  technique  I  his  seems 
to  confirm  the  conjecture  that  it  is  quite  difficult  to  use  a 
general  nonlinear  minimization  technique  lor  finding  the 
optimal  solution  of  a  svnthesis  problem  which  has  mod¬ 
erate  values  of  filter  order  A  and  impulse  response  dura¬ 
tion  \ 

VI  Ci»m  l  l  slow 

It  has  been  demonstrated  that  the  diagonalizing  trans¬ 
formation  is  a  helpful  tool  for  the  analvsis  and  svnthesis  ol 
recursive  LTV'  filters  because  there  is  a  state-to-state 
equivalence  between  the  original  filter  and  the  correspond¬ 
ing  diagonalized  filler,  the  diagonalizing  transformation 
can  be  useful  in  other  research  areas  related  to  the  recur¬ 
sive  I.  IV  systems.  In  particular,  the  observability  and  the 
controllability  properties  of  a  recursive  t.T\  filter  are 
easily  obtainable  from  the  system  matrices  of  the  equiv¬ 
alent  diagonalized  filler  Since  the  diagonalizing  transfor¬ 
mation  procedure  introduced  in  this  paper  works  only  for 
l.TV  filters  defined  in  a  finite  interval,  further  investiga¬ 
tion  is  warranted  on  whether  we  can  generalize  the 
diagonalizing  procedure  such  that  it  is  applicable  to  filters 
defined  in  an  infinite  interval 


A  I’ 1*1  M>l\ 

Print!  /or  / hctit'i'tv  / 

1 1  o  easily  seen  that  S  t  n  ).  S'lnt  and  /  Hi)  are  linear 
subspaces  of  for  /  n.  I  .  V, ,  u  ■  1.  and  that  (()} 
S.i  n  I  -  S\  ,..(«)  K  *  Then. 

;  A  A  f  ft ).  /  I).  1.  .  V,  n  J  becomes  a  collection  of  dis- 

lointed  sets  f  urther,  if  g  n ).  • .  g,,(  n  >|  is  a 

basis  of  1  (  n  )  and  anv  set  )  rj,,(  rr ).  rj, ,( /t ).  •  ■  ■ .  -qj  n  ) }  is 
contained  in  S  , ( rr  >.  then  {  g  ,(  n  )  «-  rj  n  ).  g  .(/»)  + 
rj  .(/M,  .  g  ,t't)+  r)  ;>  ( ti )  |  is  a  collection  of  independent 

column  vectors  in  the  set  A  V  <  n  >  And  the  number  of 
vectors  m  this  collection  is  equal  to  the  dimension  of  the 
Mibspace  Tin),  which  is  equivalent  to  the  difference 
between  the  dimension  of  .S’,  ( n )  and  the  dimension  of 
the  intersection  of  S'  ,(«»  and  Sihi  furthermore  n  is 
dear  that  the  vector  set  |  g  I  n  I  *  q  t  n  ).  g  (  hi  • 
rj  In).  .  g  (1  (  n  )  +  ij  4  (  n  )  |  together  with  any  set  of  inde¬ 
pendent  vectors  in  s}  ,(11)  constitute  a  new  independent 
vector  set 

Then.  Theorem  1  can  he  shown  bv  induction  \i  n 
\/(,  1.  we  have  that  Al  U  ,  ll  0  which  leads  to  that 
<2(  W„  -  1  )={()}  and  that  S 't  V/  |  ,  S  \/  I  ,  Sm.  e 

tt  V/,,  1  )q  -  0  f0|-  a||  q  a||  the  column  ve.tois  Pi  \f 

I)  are  selected  in  step  2  of  the  diagonal  1  .iiMoim.iti. >n 
procedure  From  the  discussion  m  the  prev<  u-  paragraph 
we  know  that  each  column  of  Pi  \l  lib...  nz  to  one  oi 
the  disjointed  sets  A.Strii.  /  I  \  •  n  I  these 


6  VS 

columns  are  independent.  Because  the  dimension  of  the  set 
/  (  .V/,, -1)  is  the  difference  between  the  dimensions  of 
.V  (  M, ,  1 )  and  Si  ,(  -  1),  it  is  clear  that  the  number  of 

column  vectors  of  Pi  1)  belonging  to  AS,!  ,V/„  1 )  is 

equal  to  the  difference  between  the  dimensions  of  the 
suhspaces  .V  (  Mu  -  1 )  and  V(  ,(  A/,,  1 ) 

Assume  the  statements  in  Theorem  1  are  true  for  n  =  S . 
we  need  to  show  that  they  are  also  true  for  n  A  +  1  In 
step  1  of  the  selection  procedure,  we  choose  /’ (  \  +  1)  - 
Al  V  *1 )/’(  V  )  if  Al  V  t  ]  )P  i  \  )  r  0  Assume  dial  the 
dimension  of  the  subspace  V,(  A  )  is  equal  to  A',  Since 
S,(  A  )  {()}  is  equivalent  to  the  set  A  A,(  V  ),  there  are  A, 

column  vectors  of  Pi  A  )  belonging  to  \,(  V  )  Therefore, 
there  are  A  K.  column  vectors  of  Pi  A  +  1)  selected  in 
the  step  I  And  these  A  A  ,  vectors  must  he  independent 
because  the  matrix  Al  A  +  1)  has  rank  A  A  ,  and  matrix 
Pi  A  )  has  rank  A  Further,  each  of  these  A’  -  A’,  vectors 
selected  in  step  1  satisfy  one  of  the  conditions 

<tM  \  ‘  /  ♦  1  .  A  *  1  )i/  0  and  <t> (  A  s-  /.  A  s  1  )q  z  I) 

(46) 

for  /  -  1.2.  .  V,  -  A  Therefore,  each  of  the  vectors 

selected  in  step  1  belongs  to  one  of  the  sets  A.V  (  V  +  1 ), 
/  1.2.  .  .V,  V  and  the  number  of  vectors  belonging  to 

each  set  is  equal  to  the  difference  between  the  dimension 
of  the  intersection  of  the  sets  S'  ,(  A  +  1 )  and  ,V;(  V  +  1 ) 
and  the  dimension  of  the  set  V,  ,1  A  +  I )  In  the  step  2  of 
the  selection  procedure,  each  column  vector  selected  also 
belongs  to  one  of  the  sets  A  .S',  (  A  +  I)  for  /  =  1.2.  ,V„  - 

V.  From  the  discussion  at  the  beginning  of  this  pi  oof.  the 
number  of  column  vectors  belonging  to  A.V,(  S  +  1 )  is  the 
difference  between  and  the  dimension  of  S  (  V  +  1 )  and  the 
dimension  of  the  intersection  of  Sf  ,(  A  -r  1 )  and  .V  (  V  +  1 ). 

After  reviewing  the  properties  of  the  column  vectors  in 
both  steps  of  the  selection  process,  we  have  that  the 
number  of  column  vectors  belonging  to  A  .S',  ( .V  +  1)  IS 
equal  to  the  difference  between  the  dimensions  of  .V  ( .V  +  1 ) 
and  S/  ,(  S  +  1)  for  j  =  1.2. •  •  -.  A,,  -  A.  Further,  the  new 
vectors  selected  at  each  stage  of  step  2.  together  with  the 
column  vectors  of  PI  \  r  1 )  previously  chosen,  still  con¬ 
stitute  a  set  of  independent  vectors  Thus  the  matrix 
Pi  A  +  1 )  is  a  nonsingular  matrix  Now  we  have  shown  that 
the  statements  in  Theorem  1  holds  true  for  the  matrix 
Pi  A  +  1 )  Bv  induction,  the  statement  in  Theorem  1  is  true 
for  all  n  such  that  A/,  I  <  n  f,  V,,. 

Pnmt  fur  I hrnn  ni  2 

By  using  the  revult  obtained  in  theorem  I  we  van  find 
the  transformation  matrices  Pin)  for  n  -  \/, ,  1. 

\f...  .  A.  .  and  these  matrices  have  the  properties  stated 

m  theorem  1  Assume  that  matrix  Tin)  has  rank  A  „  lor 
v/  11  •  N„.  we  have  that  there  are  A  ,  independent  col 
unm  vectors  Pin  lisuvhth.it 

Pin)  .41  11  I/’  I  11  1)  *  b  1 4'  1 

and  the  rest  ol  the  volumns  satislv  that 

(4S  1 
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From  these  two  equations  we  have  that  P  >{n)A(n)P(n 
1 )  =-  A  [  / )  if  A(  n  )P{  n  -  1 )  *  U  and  P  i{n)A{n  )P(  n  - 
1 )  =  0  if  A(  n  )P  (  n  -  1 )  =  U.  where  A[ / 1  denotes  a  column 
vector  having  the  onlv  non/ero  element  1  at  its  /th  posi¬ 
tion.  Therefore,  the  equivalent  feedback  matrix  corre¬ 
sponding  to  the  time-variant  state  transformation  obtained 
in  Theorem  1  is  a  diagonal  matrix  with  the  diagonal 
elements  of  the  value  0  or  1 

Rh-hunox 

!  1 1  N  (  Huang  and  J  k  Aggarwjl  On  linear  shift  variant  digital 

titters  //  /  /  f ran\  (  irtuns  Si »/  «i|  (  AS  2"V  pp  ^2  6"N.  Aug 

I  vxi  t 

[2\  /  J  NTkohc  A  recursive  time  van me  bandpass  filter/'  {«•## 

ph\;h\.  vol  4l),  pp  52tt  S2b.  June  !v"s 
p)  R  A  Stein.  "Design  of  recursive  Jig'tul  fillers  with  conlinuouslv 
variable  passbands."  Prm  II  PI  b'L  STwi p  .-n  (  iriuits  and  Si' 
tt’niK  (  hicugo.  II..  pp  424  42**  Apr  lvxl 
i 4 i  f  S  l.e«'u  and  J  k  Ag^arwai. '  Res ursoe  implenu ntatmn  of  I  TN 
tillers  1  ro/en  tune  transfer  function  versus  generalized  transfer 
function  l*r  >,  II  I  I  '■  ]  72  pp  vhu  VM .  Julv  1VK4 

A;  J  B  i  ru/  <  >n  the  reuli/uhihrv  of  linear  differential  svsiems."  IHI 

{  r  J h> .  i  •  I  (  T  "*  pr  '4’’  Sept  I'M) 

t  1  A  /adeh  It  tin  >  jr\  :tg  netvs.  ik.  J  Pn.t  /HI  vol  M.  pp 
1  4\ '  |MiV  (Ki  J wr  1 

H  i> 'Angelo.  /  nta-  ../<"•  4ua/.  and  \\ntht'\r> 

Hos*.  in  M  \  Allvn  and  Bason  .  * 

■V  S  Oreniei  Lnu.  xlepende:-.’  ARM  S  mode  I  ng  of  nonstationurv 

'i.nal'  II  I  I  f  rx,r  I  Spt,  h  and  S.^nai  /V,.,  cwim*  .<1 

ASSB  ' !  pi  s'**  .  \i.t  i  ex ; 

V;  I  R  Kur*  r.er  ~,nd  M  (n'ki,  and  4np;u  a:i*<n  ’  Ihn-tal  Signal 

1  , '\unK>  Inglewood  (  liffs  NJ  Brontue  Hall  1*Dk 

,1-1  M  S  \N  j.  T  ransftirtiiatn»n  of  a  hr>.  ti  mik  rvme  'V'tetn  into  a 
. ; in  jj  time  invariant  sv  stem  In  I  i  nr*  v,.l  2*  no  4.  pp 

v*m  m>2  i 

ll’  N  (  Hu  in.  and  J  k  Aggarw.it.  S.nihi'is  of  recursive  hneir 
'hit;  . arrant  d.-.tjl  Titers"  It  I  I  fr..n*  <  '  an  Sn/twi.  v..| 

<  AS  ’■».  rp  2'i  V.  J.»n  I*** 

l.  SR  Seirie  \lair  i  lljt.r,  l  ,  ’  ,i  n.>  S  /. .  New  S.-rk 

'A  .le  1  ‘-'M 

,i'j  R  Metehvt  and  SI  I  1)  Po'Aell  \  rapid  vonvergent  descent 
mefhtHl  for  insntni:;at:**n  ‘  <  >  mpa-  •  /  vo!  ^  no  2  pp  1M  Inx 
1  siM 


f/ong-Veu  Leon  (S’Klj  was  h«*m  tn  Taiwan  on  Januar'  lJ.  1^5f  lie 
received  the  B  S  and  M  S  degrees  in  electrical  engineer  n,  from  Nation. »i 
Taiwan  Lniversitv.  Taipei.  Taiwan,  tn  1^",‘*  and  l‘*xo.  r.  ectivelv 


I  mm  (,.  I^KO  he  worked  as  a  Teaching 
Assistafi>  at  Nulumal  Taiwan  L  n;vervt\  Taipe 
Taiwan  from  lVWi  tt*  14X2  he  was  a  Teaching 
Assistant  at  the  Department  of  I  lectrual  anti 
(  omputer  Lngin.enng  ai  the  l  nivtrsiK  of  Tex.i* 
jt  Austin  Aits'  n  Texas  Sinvc  l^M  he  ti.o  *'t 
j  Research  Ass  -tjni  at  the  l  iboratoiv  ft»r  Iniai-. 
and  Signal  Analysis  at  the  l  mversilv  .*1  leva--  i! 
Austin.  Austin  Texas  His  main  reseau  h  nt«  i 
e sf s  are  in  digital  signal  prtH.essing  filti  r  dvs  ^i. 
and  svsteni  the««rv 


fL 


gimiW  tl'*'2r 

I  ngineer-ng  anvl  (  .  mputcr  Sc  tenet  r 
Austin  I  utther  lie  was  a  v  .siting  a>\. stall 


J  k.  Aggarwal  (S'62  M‘f.5  SM  ^4  l  '  V(  r 
eeived  the  B  S  ;n  mathematics  and  phvsivs  fr.-i  - 
the  Universitv  of  Bonihav  <  1  )  the  B  In. 

from  the  L  niversitv  of  Liverpool,  i  ngland  1 1  Vt*  < 
and  the  M  S  and  Bh  I)  at  the  Lniversitv  . 
Illinois.  Urban  j  in  1V61  and  !  iispwn.  | . 

He  joined  The  l  rnversitv  nf  Texas  in  !u<4  i 
jn  Assistant  Bodes  u  and  has  sime  held  pos 
t.i  >ri'  as  Asm k  .at'  l'\  •  fessi  if  f  J  •VfiK  i  arid  Br»  »tc  vm  >• 
<l^'2t  «’  urrentl-  n.  >  the  John  I  MxkeP. 
I  ne’g  Br>-'-  ■>.  r  (  I  !■  .  tr  .  /  -n!  (  <  mpm 

uter  Sc  lertc «  r  T  fic  Inw'Mtx  of  Kvjs  a’ 
x  v, siting  a>s.xtatr  p u  -.o.  r  u'.  Brown  Lniversii. 


Brov -.dense  I 

<  I 

(  1  h  M.  ails!  a  1 

K'.t’nj.  a  ■ 

.  1 1 -  pr.-C 

'  xx '1 

j '  ;h.  1 

niver 

MIX  ol 

t  (  : !  !. 

■rn,. 

i  Betkelev  (  . 

iiif.-rn-..  ! . 

;  r  •  n  fc  j U6V 

1  'VV 

-  IK-  h. 

.  pub 

It  shed 

tunnel 

‘  His 

te>  hnwai 

r  -  ;n  M  o 

i  il  h.  x»ks 

No/, 

,  X. 

S .  w«  n 

-o  .  |  »  ' 

'2 1 

*•  caihar  S.  / 

,  o.,  v  .  * 

1  nj/i ' 

J 

-  '  i  (  . 

»>.pim  ’ 

\L  th. 

.J  .  1 

»lai! 

1  <af\  m  .  •  1 1  * 

/  •  . 

s  <•!«.  r  ... 

lw- 

V  i  a  IK 

Df" 

N»  ’Hi,  , 

.  IV- ;  l!  - 

v  iirr«  n-  f 

-as 

h  nterests  .,r 

.  mi  je¬ 

l  f.  ■ 

afief  .  i  --,f 

of!  H- 

.  W.«*  v.‘  . 

1  •.  - 

!  »he 

•  pee  mI 

ss  uc  - 

on  ;  > 

.  l.-i 

1  1  ilterirg 

..’'.afce 

-Ml.*  'll 

I  I 

\S  I: 

ahxax 

•  i-  ns 

March 

|  '» 

'  arid  *  »|;  M- 

■n. \  ir  !  T 

e  •  N  ar v  :nc  1’ 

Huger . 

li  Ll 

BAM 1 

T ' an- 

a«  1 . 

on  -  Nov eiubc  i 

l  i  V  X (  i  j  ... 

x!  editor  ■ 

•  O- 

!Wo  ■ 

.H.lHi' 

'  pc  s 

1  issue 

of 

(  «  mputcr  \  ,s 

.<  <n  (  i r up  h 

.  -  aiul  1  lit 

•  tie 

Bros  s'" 

th;  or 

M.  i..  -i  ,<  v<  ,1B. 


laiuiarv  w 

■  ,i  rials  P 


;  '*.x  ;  •  (  urrentlv  Ik  '  .m  ass-  c 
'  >•  '  ’M./'g*  * I  mi. ■/:  (  -  •/)./  t 

.  •  /Vo,  <«•..-  J  at  the  i  he 
! a ; i * 1  n '  (  otmmtUT  and  <*f  if. 
tl.s  (icncr.il  (  ba  rman  fot  th 


■nc-mber  ol  the  H  IT  Transnalio:-.;)  Relation'  <  otnmiUcc  aiivl  of  if 
i  d. tonal  B.-ar.’  of  UTT  Bre>s  lb-  w.  tl.s  (iencr.il  (  ba  rman  for  th 
II  LT  (  .mpi.ter  Soc.etx  (  (inference  or.  I’auern  Recognition  and  Intake 
Brocess.riu  i  Kt'.L.  s  ptsi  and  was  ihe  [  .  og:  afn  (  hatrriian  (<>r  th:  I  ;r  ’ 
(  onterciice  'ti  Artific.al  IntoHiger.c  Applications  sponsored  b\  H  IT 
<  oinputet  S-  v  etv  and  AAAI  held  :n  D/nvu  1VK4 

Dr  Aggirwai  :s  ..  member  of  A*  M  \A  ci  The  Intcrnat.onal  Socielx 
for  Opfcjl  B.ng  ncer.ng.  Pattern  Rcs<  gmtiop  Soeietv  a  kI  I  ta  kappa  NT. 


•.  %  .  *  \  *.  *.  *,  *.  *.  /.  %  •.  -W  -*  '  ■,  .  -  »  -S  ,  * 

-  *•  *  »  'V7-  \ -  .  *  "  .  /  •  * **" O'  '  *2  • 


^  T*  S 

s 


Hepri  ntrd  from  Jolr.su.  ok  iHt  Electrochemical  Society 
Vol  UJ.  No  2.  FVbmar>  1986 
Printed  in  I  SA 
Cop\  right  1986 


Temperature  Response  of  GaAs  in  a  Rapid  Thermal  Annealing 
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Rapid  thermal  annealing  (RTA)  systems  are  used  to 
heat  semiconductors  for  short  periods  of  time,  usually 
seconds.  This  technique  has  a  variety  of  applications  in 
the  processing  of  Si  and  III-V  compound  semiconductors 
(1).  In  an  RTA  system  which  uses  radiant  energy  to  heat 
the  semiconductors,  one  might  expect  to  find  variations 
in  the  temperature  response  of  materials  due  to  their  dif¬ 
ferent  absorption  properties.  An  investigation  of  GaAs 
and  Si  shows  that  this  is  indeed  the  case,  and  that  the 
temperature  \ariation  is  quite  pronounced. 


sponse.  Comparison  of  doped  vs.  semi- insulating  GaAs  in 
Fig.  2  shows  the  tremendous  effect  of  free  carrier  concen¬ 
tration  on  the  temperature  response,  an  effect  also  found 
in  silicon  by  Seidel  et  al.  (5).  However  this  is  not  enough 
to  explain  the  data  completely.  At  750“C  the  free  carrier 
concentration  of  Si  is  greater  than  that  of  the  doped 
GaAs,  yet  GaAs  is  at  a  higher  temperature  This  effect 
must  be  due  to  basic  differences  in  the  bulk  material 
properties,  for  example  direct  vs.  indirect  bandgap,  size  of 


Experimental 

The  RTA  system  used  in  these  experiments  is  similar  to 
that  described  by  Davies  and  Kennedy  (2),  and  consists  of 
a  heating  chamber  containing  two  elliptically  shaped  cav¬ 
ities  which  focus  light  produced  by  two  2  kW  tungsten 
filament  quartz  lamps  onto  a  sample  suspended  at  the 
center  of  the  chamber.  A  microprocessor  controls  power 
to  the  lamps  with  a  thermocouple  for  temperature  feed¬ 
back.  Samples  are  suspended  on  very  thin  (-250  nm 
thick)  silica  slides  inside  a  silica  tube  which  allows  an¬ 
nealing  in  a  selected  gas  ambient. 

Semi-insulating  (100)  GaAs  doped  with  Cr,  p-type  (100) 
GaAs  doped  with  7  *  10'7  cm-'  Zn,  and  n-type  (100)  Si 
doped  with  4  x  10"  P  were  cut  into  samples  -8  mm  x  8 
mm  Holes  were  etched  into  the  GaAs  using  a  20%  bro¬ 
mine-methanol  etch  and  a  mask  of  CVD  deposited  SiCX, 
and  into  the  Si  using  a  solution  of  pyrocatechol,  ethylene 
diamine,  and  water  (3)  200  nm  of  CVD  SiO,  containing 
7%  P  was  then  deposited  on  each  sample.  K-type  thermo¬ 
couples  (0.002  in.  diam  wire)  were  glued  into  the  holes 
with  a  mixture  of  Aremco  no.  516  cement  (ZrO)  and  either 
ground  GaAs  or  Si  using  a  procedure  similar  to  that  sug¬ 
gested  by  Cohen  et  al.  (4).  Such  small  thermocouples 
were  used  to  insure  that  the  temperature  response  of  the 
samples  would  be  little  affected  by  the  thermocouples. 

Throughout  the  experiments,  the  GaAs  samples  were 
either  thermally  isolated  from  each  other  or  placed  on  a 
larger  piece  of  silicon  to  thermally  connect  them,  as 
shown  in  Fig.  1.  Temperature  feedback  to  the  controller 
was  provided  by  the  Si  sample.  These  two  configurations 
were  then  submitted  to  a  heat  cycle  consisting  of  a  pre¬ 
heat  at  300°C  for  30s  to  stabilize  initial  conditions,  then  an 
instantaneous  change  of  setpoint  to  750°C,  a  hold  at  this 
temperature  for  30s  (counted  after  the  sample  is  within 
30"C  of  750°C),  and  then  a  change  of  the  setpoint  back  to 
zero.  The  heat  cycling  was  done  in  a  stagnant  N,  atmo¬ 
sphere 


Results  and  Discussion 

Temperature  response  curves  for  the  thermally  isolated 
samples  are  shown  in  Fig.  2.  Curve  (a)  is  the  programmed 
setpoint  and  curve  (b)  is  the  thermocouple  output  of  the 
Si  sample.  The  Zn  doped  and  semi-insulating  GaAs 
samples  are  shown  as  curve  (c)  and  (d),  respectively 
Clearly,  GaAs  couples  to  the  radiant  flux  differently 
than  Si,  exhibiting  a  drastically  different  temperature  re- 
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Fig.  1 .  Schemotic  diagram  of  tampla  configuration  for  (a)  Hiermolly 
iioloted  and  (b)  thermoMy  connactad  com 


Fig  2  Temperature  response  euro*  for  tfiermoHy  isolated  samples 
(o)  tamparotura  setpoint,  (b)  t  doped  Si,  (c)  Zn  doped  GoAi,  and  (d) 
Cr  doped  GoAi. 
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Fig.  3.  Temperature  response  curve  for  thermally  connected 
samples:  (o)  temperature  setpoint,  (b)  P  doped  Si,  (c)  In  doped  GaAs, 
and  (d)  Cr  doped  GaAs. 


bandgap,  optical  absorption  parameters,  and  thermal  con¬ 
ductivity.  Theoretical  work  by  Borisenko  et  al.  (6)  pre¬ 
dicts  different  heating  rates  for  materials  with  different 
emissivities.  Another  factor  may  be  a  shift  in  the  fre¬ 
quency  distribution  of  the  tungsten  lamps'  output  at  dif¬ 
ferent  power  levels  (due  to  blackbody  radiation  at  differ¬ 
ent  temperatures).  As  the  lamp  power  is  cut  back  to  hold 
the  sample  at  temperature,  the  light  shifts  to  longer  wave¬ 
lengths,  which  may  affect  the  relative  amounts  of  band- 
to-band  vs.  free  carrier  excitation. 

Results  for  the  thermally  connected  samples  are  shown 
in  Fig.  3.  Curves  (a)  and  (b)  are  again  the  programmed 
setpoint  and  the  Si  sample  response,  (c)  and  (d)  the  Zn 
doped  and  semi-insulating  GaAs  samples. 

When  the  materials  are  put  in  thermal  contact,  their 
temperature  behavior  becomes  similar.  The  significant 
temperature  overshoot  of  setpoint  is  due  to  thermal  lags 
in  the  control  loop  from  the  configuration  in  Fig.  lb.  In 
actual  anneals,  the  large  piece  of  Si  pictured  in  Fig.  lb 


would  contain  the  thermocouple.  In  such  a  setup,  we 
have  observed  negligible  temperature  overshoot  and  the 
temperature  difference  between  GaAs  and  Si  is  usually 
less  than  15“C  at  750°C.  It  should  be  noted  that  this  an¬ 
nealing  arrangement  requires  good  thermal  contact  of  the 
sample  to  the  large  Si  susceptor.  We  also  found  the  tem¬ 
perature  response  to  be  sensitive  to  gas  flow  in  the  sys¬ 
tem.  A  high  flow  rate  resulted  in  a  GaAs  temperature 
50°C  higher  than  the  temperature  of  the  Si  susceptor  at 
750°C.  The  source  of  this  phenomenon  is  not  clear;  the  an¬ 
neals  were  therefore  performed  in  a  stagnant  atmosphere. 

These  results  point  to  an  important  fact:  GaAs  behaves 
quite  differently  from  Si  in  a  rapid  thermal  annealing  sys¬ 
tem.  Consequently,  the  temperature  of  the  GaAs  must  be 
used  to  control  the  system.  This  can  be  done  either  di¬ 
rectly,  by  using  a  pyrometer  looking  at  the  GaAs  or  indi¬ 
rectly,  by  having  the  GaAs  in  isothermal  contact  to  a  sus¬ 
ceptor  to  which  a  thermocouple  is  attached.  The  latter 
method  is  the  less  complicated  of  the  two  and  avoids 
problems  pyrometers  themselves  face  in  a  radiant  RTA 
system  (7). 

Acknowledgments 

The  authors  acknowledge  support  from  the  Joint  Ser¬ 
vices  Electronics  Program  under  contract  No.  AF'OSR 
F49620-82-C-0033  and  the  National  Science  Foundation 
under  grant  no.  ECS-8420002.  We  also  wish  to  thank  D. 
Neikirk  and  S.  Lester  for  useful  discussion  during  the 
course  of  the  experiments. 

Manuscript  submitted  Oct.  4,  1985. 

The  University  of  Texas  assisted  in  meeting  the  publica¬ 
tion  costs  of  this  article. 

REFERENCES 

L  J-  F.  Gibbons,  D.  M.  Dobkin,  M.  E.  Greiner,  J.  L.  Hoyt, 
and  W.  G.  Opyd,  Mater.  Res.  Soc.  Symp.  Proc.,  23,  37 
(1984). 

2.  D.  E.  Davies  and  E.  F.  Kennedy,  Electronics  Lett.,  18, 

no.  7,  282  (1982). 

3.  K.  E.  Bean,  IEEE  Trans.  Electron.  Devices,  ed-25,  1185 

(1978). 

4.  S.  A.  Cohen,  T.  O.  Sedgewick,  and  J.  L.  Speidell,  Ma¬ 

ter.  Res.  Soc.  Symp.  Proc.,  23,  321  (1984). 

5.  T.  E.  Seidel,  D.  E.  Lischner,  C.  S.  Pai,  and  S.  S.  Lau,  J. 

Appl.  Phys.,  57,  (4),  1317  (1985). 

6.  V.  E.  Borisenko,  V.  V.  Gribkovskii,  V.  A.  Labunov,  and 

S.  G.  Yudin,  Phys.  Status  Solidi  A,  86,  573  (1984). 

7.  C.  Russo,  Nucl.  Instrum.  Methods.  B6,  298  (1985). 


I 

I 


Ht-pritllt  H  tnilll  JolKWI.  UK  I  Ht  El  t<  IHIKHt'IK  M  StKIltn 
\«»l  I  U.  No  in.  (Xr.iln-I  1 9fUi 
Printed  in  l'  S  A 
t’np\  rt>>li?  IWi 


Factors  Influencing  the  Photoluminescence  Intensity  of  InP 

S.  D.  Lester,*  T.  S.  Kim,  and  B.  G.  Streetman** 

Department  of  Electrical  and  Computer  Engineering.  Microelectronics  Research  Center.  The  University  uj  Texas. 

Austin.  Texas  78712 


4H 


Q 


Room  temperature  photoluminescence  (PL)  has  recently 
been  used  as  a  tool  to  assess  the  quality  of  InP  substrates 
during  and  after  various  processing  steps  and  the  quality  of 
InP/insulator  interfaces.  It  has  generally  been  suggested  that 
high  PL  intensities  reflect  high  quality  bulk  material  or  high 
quality  interfaces.  For  example,  band  edge  PL  intensity  has 
been  used  to  assess  the  effectiveness  of  annealing  treatments  for 
activating  ion-implanted  dopants1  and  has  been  used  in  a 
number  of  studies  to  monitor  processing  steps  used  during  the 
fabrication  of  MIS  devices.-  In  the  former  studies  it  was 
suggested  that  high  PL  intensities  reflect  high  quality  annealing 
(a  bulk  property)  and  in  the  latter  studies  it  was  suggested  that 
the  PL  intensity  of  n-type  InP  yields  a  reliable  estimate  of  the 
interface  state  density  in  the  upper  part  of  the  band  gap.  In  this 
communication  we  discuss  factors  which  influence  band-edge 
PL  intensity  and  point  out  that  great  care  must  be  taken  in 
interpreting  such  data. 

Although  many  factors  are  involved  in  determining  PL 
intensity,  three  important  influencing  factors  are  bulk 
parameters  (i.e.,  mobility,  lifetimes,  doping  level,  etc.),  the 
surface  recombination  velocity  (S),  and  the  presence  of  a  space 
charge  region  (electric  field)  at  the  surface.  The  first  of  these 
reflects  bulk  crystal  quality  and  the  second  two  are  determined 
by  surface  properties.  The  relative  influence  of  these  factors  in 
determining  the  PL  intensity  of  a  given  sample  is,  in  general, 
very  difficult  to  determine;  thus,  processing-induced  changes  in 
PL  intensity  can  easily  he  misinterpreted.  We  note  that  PL 
intensity  changes  have  been  used  as  a  quantitative  measure  of 
GaAs  surface  properties. 

Although  PL  has  long  been  recognized  as  a  near-surface 
probe  of  bulk  material,  surface  effects  can  very  strongly 
influence  the  intensity  of  band-edge  luminescence.  This  surface 
sensitivity  is  dramatic  in  the  case  of  n-type  InP  where  both 
liquid  and  gas  ambients  have  been  shown  to  have  pronounced 
effects  on  PL  intensities.  In  the  case  of  n-InP  immersed  in 
chemical  solutions,  it  has  been  demonstrated,5  and  confirmed  in 
our  laboratory,  that  PL  intensities  can  be  changed  by  three 
orders  of  magnitude.  For  example,  in-situ  measurements  of 
n-InP  alternately  flushed  with  DI  water  and  dilute  HF  show  that 
the  band-edge  intensity  can  be  reversibly  varied  by  -lOOOx  in  a 
very  short  time.  Apparently,  different  chemical  treatments  also 
leave  the  InP  surface  with  varying  degrees  of  "stability"  to 
subsequent  changes  in  PL  intensity  caused  by  exposure  to  air  or 
low  temperature  annealing.5'6 


•Electrochemical  Society  Student  Member 
••Electrochemical  Socicly  Active  Member 


Another  example  of  surface  effects  on  PL  intensity  is 
the  case  of  InP  exposed  to  various  gas  ambients.  Fig.  1  shows 
the  band-edge  PL  intensity  of  an  undoped  (  n=5xI015  cm"3  ) 
sample  repeatedly  exposed  to  oxygen  and  nitrogen.  As  the 
figure  shows,  the  PL  intensity  is  reduced  in  oxygen,  increased 
in  nitrogen,  and  can  be  cycled  repeatedly.  The  size  of  these 
ambient-induced  PL  changes  are  influenced  by  many  factors, 
including  pressure  (flowrate),  substrate  doping  concentration 
and  type,  the  laser  intensity,  humidity,  and  the  history  of  the 
sample.  Some  of  these  effects  have  been  reported 
previously,7,8  and  a  more  detailed  description  will  be  presented 
elsewhere.9  We  also  note  that  GaAs  is  sensitive  to  ambient 
effects,  but  to  a  lesser  degree  than  InP.  Like  the  effects  of 
chemical  solutions,  these  ambient  effects  are  very  substantial 
(in  certain  cases  the  intensity  can  be  change  by  >5x)  and 
illustrate  the  high  surface  sensitivity  of  band  edge  PL  intensity. 
The  fact  that  PL  intensity  is  so  strongly  influenced  by  these 
surface  effects  (including  interface  effects  at  InP/dielectric 
interfaces)  indicates  that  great  care  must  be  taken  in  extracting 
bulk  information  from  PL  intensity  data.  We  also  note  that  PL 
intensities  measured  at  low  temperatures  (samples  immersed  in 
liquid  He)  are  sensitive  to  sample  surface  properties  prior  to 
cooling.  To  safely  compare  the  PL  intensities  of  a  number  of 
samples  it  is  therefore  important  to  insure  that  the  samples 
have  nearly  identical  surfaces. 


Fig.  1.  Band-edge  PL  response  of  n=5xl015  cm'1  InP  to 
ambient  changes. 


Another  notable  feature  of  Fig.  I  is  the  gradual 
reduction  of  PL  intensity.  This  slow  trend  is  at  least  partially 
reversible  and  has  previously  been  attributed  to  oxidation.7  We 
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have  found  that  this  slow  trend  is  a  direct  result  of  the 
illumination  process  and  can  substantially  reduce  the  PL 
intensity  under  high  laser  power  levels.  Therefore,  it  should  be 
recognized  that  the  illumination  process  itself  can  markedly 
alter  the  surface  properties  and  consequently  the  PL  intensity  of 
InP,  so  care  should  be  taken  to  account  for  it  when  taking  room 
temperature  PL  data  (i.e„  samples  should  receive  comparable 
amounts  of  laser  excitation  prior  to  the  start  of  the 
measurement). 

The  previous  examples  have  shown  that  surface 
properties  have  an  important  influence  on  PL  intensity  and  need 
to  be  considered  if  bulk  information  is  to  be  extracted  from  PL 
intensity  data.  On  the  other  hand,  these  surface  effects  can  be 
used  as  a  tool  for  studying  InP  surfaces  and  interfaces.  If 
surface  information  is  to  be  obtained  it  is  important  to 
distinguish  between  the  effects  of  band  bending  changes  and 
surface  recombination  velocity  changes  on  measured  PL 
intensities.  As  suggested  by  Aspnes,10  to  do  this,  it  is 
extremely  helpful  to  have  a  second  measurement  technique 
which  can  give  an  independent  measure  of  the  surface  Fermi 
level  (Hps).  Possible  techniques  for  this  include  Raman  or 
photoemission  spectroscopy  or  the  use  of  PL  with  two 
excitation  wavelengths;  however,  the  simplest  example  of  such 
a  technique  is  to  measure  the  resistance  of  a  thin  film  resistor 
which  will  have  a  resistivity  ihat  is  a  function  of  the  depth  of 
the  space  charge  region  at  the  surface. 

Figure  2  shows  the  resistivity  of  an  n-type  resistor 
structure  in  oxygen  and  nitrogen.  The  structure  was  made  by 
implanting  Si  (1012  cm_i  @  150  keV)  into  a  '3  x  5  mm 
InP.Fe  sample  and  alloying  livSn  ohmic  contacts.  Figure  2: 
shows  the  resistance  under  illumination  and  Fig.  2b  shows  the 
resistance  in  complete  darkness.  It  is  clear  from  these  figures 
that  the  resistance  of  this  structure,  like  the  PL  intensity,  is 
reversibly  changed  with  ambient.  Since  the  resistance 
under  illumination  depends  on  the  surface  recombination 
velocity,  Fig.  2a  is  not  sufficient  to  indicate  wither  a  change 


Fig.  2  Resistivity  response  of  an  n-type  resistor  (a)  under 
illumination  and,  (b)  in  complete  darkness. 


in  s  or  a  change  in  band  bending  is  responsible  for  the  PL 
intensity  and  resistance  changes.  However,  Fig.  2b  clearly 
indicates  that  ambient  changes  result  in  changes  in  the  surface 
Fermi  level  (depletion  depth).  This  change  in  Eps  and  the 
depletion  depth  then  results  in  the  PL  intensity  response 
shown  in  Fig.  1.  Such  PL  intensity  variations  have 
previously  been  attributed  solely  to  changes  in  S.7  Our  d  ta, 
of  course,  does  not  exclude  a  change  in  S,  and  in  fact  S  should 
be  a  function  of  Fermi  level  position.  However,  this  does 
demonstrate  that  the  surface  Fermi  level  is  affected  by  ambient 
changes  and  that  changes  in  Eps  must  be  considered  when 
interpreting  PL  intensity  variations. 

In  summary,  room  temperature  PL  can  be  an  extremely 
useful  technique  for  investigating  bulk  and  surface  properties  of 
InP.  However,  since  the  intensity  of  the  band-edge  transition  is 
strongly  affected  by  a  number  of  factors,  great  care  must  be 
taken  in  interpreting  intensity  data.  If  bulk  information  is  to 
be  obtained,  samples  must  be  prepared  with  identical  surfaces 
and  if  meaningful  surface  information  is  sought,  a  second 
measurement  should  be  used  to  separate  the  effects  of  band 
bending  from  the  effects  of  changes  in  the  surface 
recombination  velocity. 
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Experimental  observation  of  adsorbate  orbital  splitting  at  single-crystal  metal  surfaces 
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Splitting  of  the  j  component  of  the  photoexcited  Xe  ion  doublet  is  observed  on  the  (110)  planes  of 
several  metal  surfaces  This  effect  is  shown  to  originate  from  a  true  “crystal-field”  effect,  not  as  a  conse¬ 
quence  of  adatom-adatom  interactions  The  splitting  therefore  provides  a  probe  of  local  fields  at  the 
screened  ion 


One  of  the  more  important  parameters  associated  with 
surface  phenomena  is  the  local  potential  Recent  efforts  to 
more  thoroughly  understand  local  surface  potentials  include 
calculations  of  the  orbital  splitting  of  atoms  approaching  a 
metal  surface,1  studies  of  dipole  moments  and  polarizabili¬ 
ties  of  adsorbed  atoms2  (both  ground-state  effects),  and 
analysis  of  charge  transfer  and  screening  effects  at  metal 
surfaces  which  accompany  various  photon  and  electron  exci¬ 
tation  phenomena  One  particular  focus  of  work  on  this 
problem  has  involved  excitation  properties  of  rare-gas 
atoms'"1'  on  metal  surfaces,  i.e..  systems  which  have  well- 
defined  ground  states. 

Waclawski  and  Herbst6  conducted  one  of  the  first  photo- 
emisston  investigations  of  a  rare-gas  monolayer  on  a  metal 
surface.  They  reported  significant  broadening  of  the  SPy2 
component  of  the  spin-orbit  spin  5p  level  of  Xe  physisorbed 
on  WI100),  and  attributed  the  broadening  to  unresolved 
vplitting  resulting  from  the  surface  crystal  field.  This  inter¬ 
pretation  stimulated  several  model  calculations"9  and  addi¬ 
tional  experiments10  aimed  at  testing  the  hypothesis  of  sur¬ 
face  crystal  fields  in  more  detail.  These  calculations,  and 
subsequent  experiments,10’ 12  which  were  conducted  under 
more  carefully  controlled  conditions,  have  shown  that  the 
broadening  of  the  >Py2  line  observed  by  Waclawski  and 
Herbst  at  full  monolayer  coverages  was  not  due  to  crystal- 
field  or  image  charge  effects  8  In  this  case,  the  line 
broadening  resulted  from  Xe-Xe  interactions  as  shown  by 
angle-resolved  photoemission  determination  of  the  Xe-band 
dispersion  throughout  the  surface  Brillouin  zone  and  com¬ 
parison  with  results  of  simple  tight-binding  calculations. 

More  recent  experiments  by  Opila  and  Gomer13  have 
again  raised  the  issue  of  surface  image  dipole  or  crystal-field 
effects  in  relation  to  the  photoemission  spectra  of  phy¬ 
sisorbed  rare-gas  atoms  on  meta!  surfaces.  In  these  careful¬ 
ly  conducted  experiments  a  very  convincing  case  is  present¬ 
ed  in  support  of  the  existence  of  a  mechanism,  unrelated  to 
adatom-adatom  interactions,  which  splits  the  SPy2  line  of 
rare-gas  atoms  on  W ( 1 10).  This  result  is  in  contrast  to  the 
null  result  obtained  by  Erskine10  under  similar  experimental 
conditions  for  Xe  on  W(100) 

We  have  recently  conducted  extensive  photoemission 
studies  of  physisorbed  rare-gas  atoms  on  single  crystal  NiAl 
alloy  surfaces14  to  investigate  surface  stoichiometry  and  local 
work  functions  as  probed  by  the  photoemission  of  adsorbed 
xenon  (PAX)  technique  |S  During  this  study  we  observed 
splitting  of  the  5 PV2  component  of  photoexcited  Xe  atoms 
on  the  NiAI(llO)  surface,  but  not  on  the  (100)  surface  of 
the  same  ordered  alloy  The  splitting  was  observed  using 
experimental  conditions  under  which  Xe-Xc  interactions  are 


negligible  (i.e.,  low  coverages),  leading  to  the  conclusion 
that  the  splitting  is  similar  in  nature  to  that  reported  by  Opi¬ 
la  and  Gomer.  We  have  conducted  additional  experiments 
on  Nil  110)  and  Nil  100)  surfaces  which  yield  the  same 
result,  i.e..  that  the  lower-symmetry  surface  produces  split¬ 
ting  of  the  SPy2  line.  The  purpose  of  the  present  Brief  Re¬ 
port  is  to  present  these  results  which  suggest  that  splitting 
of  the  7—  y  component  of  photoexcited  Xe  atoms  phy¬ 
sisorbed  at  low  coverage  on  (110)  surfaces  results  from  the 
lower  coordination  symmetry  of  the  adsorption  site. 

Experiments  reported  here  were  conducted  using  an 
Auger-photoelectron  spectrometer10  equipped  with  low- 
energy  electron  diffraction  (LEED)  optics  and  a  cold  stage 
manipulator  capable  of  sample  temperatures  ranging  from 
1200  to  below  30  K  The  3- -in -diam*  -r- -in. -thick 
NiAI(llO)  samples  were  aligned  to  ±1°  using  x-ray  Laue 
techniques,  and  spark  cut  and  mechanically  polished  using 
alumina  powder  to  0.05-jzm  grit.  In  situ  cleaning  using  Ne 
ion  sputtering  (500  eV.  10  ^A/crn2)  and  annealing  to 
800  °C  yielded  clean,  well-ordered  surfaces.  Auger  analysis 
of  the  clean  surfaces,  our  work14  in  which  work-function 
changes  were  studied,  and  chemisorption  experiments16  in¬ 
volving  CO  indicate  that  the  well-annealed  NiAI(llO)  sur¬ 
faces  exhibit  a  stoichiometry  (ratio  of  Ni  to  Al)  equal  to  the 
bulk  value  (i.e..  1  to  1).  Recent  LEED  studies  of  this  sur¬ 
face  suggest  a  reconstruction  involving  atomic  rippling.1 
which  consists  of  small  displacement  of  the  surface  atoms 
(  —0  08  A)  perpendicular  to  the  surface.  Extensive  angle- 
resolved  photoemission  studies18  of  the  NiAKllO)  surface 
using  synchrotron  radiation  have  yielded  bulk  band  struc¬ 
ture  in  good  agreement  with  calculations.  These  experi¬ 
ments  constitute  additional  characterization  of  the 
NiAK  1 10)  surface. 

Our  interpretation  of  the  splitting  of  the  5/Y:  peak  in 
terms  of  Iocal-substrate-related  fields  rather  than  adsorbate- 
adsorbate  coupling  relies  on  accurate  knowledge  of  the  sur¬ 
face  conditions,  including  substrate  order  and  composition, 
which  was  just  discussed,  as  well  as  adatom  coverage  and 
spatial  distributions.  Adatom  concentration  was  accurately 
calibrated  at  integral  monolayer  coverages  by  analysis  of 
multipeak  spectra  resulting  from  layer-dependent  XOO 
Auger  energy  shifts,1"  and  checked  using  5 P  valence  level 
shifts  observed  in  photoemission  spectra.  Uniform  Xe 
layers  of  n  =  1 .  2.  and  3  monolayers  could  be  obtained  by 
adsorption  followed  by  carefully  monitored  annealing  Sub- 
monolayer  coverages  were  determined  from  work-function 
changes,  which  are  roughly  linear  in  the  0-0  5-ML  range, 
and  from  the  intensity  of  the  angle-integrated  photoemis¬ 
sion  peaks  relative  to  the  NiAl  </  bands  measured  in  the 
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same  configuration  used  in  calibration  experiments.  Sample 
temperatures  during  and  after  adsorption  were  maintained 
below  40  K  to  ensure  low  surface  mobility  of  the  adsorbed 
rare  gases.  No  evidence  of  island  or  cluster  formation  of  Xe 
atoms  was  observed  in  photoemission  (i.e.,  k«  dispersion  of 
peaks)  or  by  LEED  analysis  at  these  temperatures.  Weak 
halos  having  sixfold  symmetry  were  observed  in  LEED 
studies  of  low  coverage  Xe  films  only  after  annealing  to 
—  100  K,  indicating  that  temperatures  in  this  range  are  re¬ 
quired  to  induce  island  formation. 

Figure  1  displays  angle-resolved  electron  energy  distribu¬ 
tion  curves  (EDC’s)  for  various  coverages  of  physisorbed 
Xe  on  NiAI(llO)  at  T  —  30  K.  Various  features18  of  the 
d  bands  of  NiAl  along  the  A  direction  of  the  three- 
dimensional  Brillouin  zone  are  apparent  in  the  energy  range 
within  5  eV  of  the  Fermi  energy,  £>.  Submonolayer  Xe 
spectra  exhibit  two  primary  peaks  corresponding  to  the 
5P\n,  SPy/2  states  of  the  ion.  Spectra  for  coverages  greater 
than  one  monolayer  exhibit  two  additional  peaks  which  in¬ 
crease  in  strength  with  coverage.  These  are  the  second 
layer  peaks,  which  are  shifted  to  higher  binding  energy  due 
to  less  effective  screening  of  the  ion  (relaxation  shifts),  and 
which  were  used  in  thickness  calibration. 

Close  inspection  of  the  5 /^-derived  peaks  corresponding 
to  low  coverage  reveals  that  it  is  not  symmetric  as  is  the 
5Pi/2  peak  at  equal  coverage.  Figure  2  illustrates,  on  an  ex¬ 
panded  scale,  the  two  peaks  for  equal  coverages  of  Xe  on 
NiAl(llO)  and  on  NiAl(lOO).  One  does  not  expect  to  be 
able  to  resolve  the  actual  crystal-field  splitting  of  the  P}/2 
level  because  the  intrinsic  broadening  of  the  lines  due  to  re¬ 
laxation  mechanisms  related  to  the  presence  of  the  metal 
surface  is  approximately  equal  to  the  splitting.  The  inset  of 
Fig.  2  compares  results  of  curve  fitting  the  two  EDC’s  using 
three  Gaussian  functions,  assuming  that  the  5P}/2  com- 


FIG  I  Angle-resolved  photoemission  spectra  for  Xe  adsorbed 
on  NiAKI  10)  as  a  function  of  Xe  coverage. 


poncnt  in  each  case  is  composed  of  two  Gaussians  having 
the  same  width  as  the  5PV2  component.  This  analysis 
shows  that  splitting  of  the  5P)/2  line  is  approximately  0.35 
eV. 

There  are  no  detectable  shifts  in  peak  positions  as  a  func¬ 
tion  of  Xe  coverage  at  low  coverages.  This  coverage  in¬ 
dependence  suggests  that  splitting  of  the  5P}/2  line  results 
from  coupling  between  the  screened  ion  and  the  substrate 
rather  than  with  neighboring  Xe  atoms.  The  annealing  ex¬ 
periments  which  established  the  temperature  at  which  is¬ 
lands  did  form  also  eliminated  the  possibility  that  the  in¬ 
dependence  of  binding  energy  with  coverage  resulted  from 
islands  or  clusters  of  constant  density  at  all  coverage. 

We  have  carried  out  corresponding  experiments  involving 
low  coverage  Xe  layers  on  Ni(lOO)  and  Ni ( 1 10) .  These  ex¬ 
periments  yield  a  similar  splitting  of  the  SP^  peak  for  Xe 
on  the  Ni(110)  surface  but  not  on  the  Ni(lOO)  surface 
Close  inspection  of  experimental  results  of  Jacobi  and 
Rotermund,19  which  were  obtained  under  similar  experi¬ 
mental  conditions,  also  reveals  a  split  SP}/2  level  for  low 
coverages  of  Xe  on  Nit  1 10) ,  in  agreement  with  our  results. 
Our  previous  search  for  crystal-field  splitting  on  the  W(100) 
surface  revealed  no  splitting  of  the  SPj/j  level,10  but  photo¬ 
emission  studies  of  Xe  of  Opila  and  Gomer  on  the  W (110) 
surface  exhibit  splitting  of  this  level.19  Based  on  these  ex¬ 
perimental  data.7  it  appears  that  the  splitting  of  the  5 Pin 
state  could  be  related  to  the  reduced  symmetry  of  the  ad¬ 
sorption  site  on  the  (110)  surfaces. 

To  validate  this  possibility,  one  must  be  convinced  that 
the  (110)  and  the  (100)  faces  of  W,  Ni,  and  B-NiAl  can  and 
probably  do  yield  qualitatively  different  local  environments 


FIG.  2.  Expanded  scale  of  angle-resolved  photoemission  spectra 
for  Xe  adsorbed  on  NiAKI  10)  and  Ni  Alt  100)  surfaces.  Inset,  line 
intensities  as  determined  by  Gaussian  fitting  of  the  data  Values  of 
the  splitting  are  Xe  on  NiAKI  10),  A-  1.22  ±004  eV,  A'-0.37  ±04 
eV,  Xe  on  NiAI(lOO),  A  -  1  20  ±  0  04  eV.  A'-0  06  ±0.06  eV 
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TABLE  I.  Polar-angle  dependence  of  5/>1/I  and  SPy2  linewidths  of  photoemission  spectra  for  Xe  adsorbed 
on  NiAI(llO).  Photon  energy  hv- 21.22  eV,  sample  temperature  T~-  30  K  Angle  is  measured  from  the 
sample  normal  along  the  1 110]  direction;  center  and  width  energies  are  in  eV  Angular  resolution 
±4°. 


"*1/2 

Polar 

Center 

Width 

Center 

Width 

angle 

(below  tF) 

(below  tf  ) 

0“ 

7.05 

0.54 

5  83 

0  75 

10’ 

7.05 

0.53 

583 

0  7  0 

20° 

7.05 

0.52 

5  83 

067 

30° 

7.05 

0.50 

5.83 

063 

40° 

7.05 

0.50 

5.83 

063 

for  adsorbed  Xe  For  physisorbed  atoms,  at  low  coverage, 
the  preferred  site  will  most  likely  be  the  deepest  hollow  sites 
in  a  surface  unit  cell.  For  bcc  W(100),  Xe  should  physisorb 
at  the  C4v  fourfold  hollow  site,  for  W(110)  the  correspond¬ 
ing  site  has  C2v  symmetry.  In  the  case  of  fee  Ni,  again,  the 
preferred  surface  site  on  the  (100)  face  will  have  C4v  sym¬ 
metry,  and  C2u  symmetry  on  the  (110)  face.  The  crystal 
structure  of  )3-NiAl  is  the  CaF2  (cubic)  structure.  Here 
again,  the  (100)  surface  offers  only  C4v  sites,  whereas  the 
(110)  surface  offers  C2u  sites.  Qualitatively,  the  argument 
of  a  symmetry  based  origin  of  the  splitting  appears  valid. 

Herbst’  has  investigated  theoretically  the  angular  depen¬ 
dence  of  photoelectrons  from  atoms  adsorbed  on  metal  sur¬ 
faces,  taking  into  account  the  effects  of  the  substrate  atoms. 
Although  none  of  the  specific  results  obtained  by  Herbst  ap¬ 
ply  directly  to  Xe  adsorbed  in  the  C2v  site  on  NiAl,  one  of 
the  general  characteristics  of  the  model  should  apply.  This 
characteristic  is  that  the  polar-angle  variation  of  photoelec¬ 
tron  emission  associated  with  component  lines  of  a  crystal- 


field  split  level  will  be  different 
Table  1  illustrates  the  experimentally  determined  polar  an¬ 
gle  dependence  of  the  5 Py2,  5 P}/2  linewidths  for  Xe  on 
NiAI(llO).  The  intrinsic  broadening  is  too  large  to  clearly 
resolve  the  SPy2  state  splitting  (as  shown  in  Fig.  2).  How¬ 
ever,  two  features  are  clear  from  our  polar-angle-variation 
data.  First,  the  binding  energies  of  the  SPV2  and  5 P}/2 
states  are  independent  of  polar  angle,  confirming  that  lateral 
interactions  (which  would  produce  band  dispersion)  are  not 
present.  Second,  the  5Py2  linewidth  is  nearly  constant, 
whereas  the  SPy2  linewidth  changes  significantly,  as  would 
be  expected  if  the  intensity  ratio  of  the  component  lines 
changed  This  constitutes  additional  evidence  of  a  local 
crystal-field  origin  of  the  splitting. 
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Results  of  a  combined  experimental-theoretical  study  of  the  electronic  structures  and  properties  of  epitax¬ 
ial  Fe  on  CuUOO)  are  reported.  Angle-resolved  photoelectron  spectroscopy  is  used  to  determine  the  elec¬ 
tronic  structure  of  one,  two,  and  four  layers  of  epitaxial  Fe  on  Cu(100).  The  experimentally  determined 
two-dimensional  energy  bands  of  p(  1  x  1  )Fe  monolayers  and  bilayers  verify  predictions  of  local  spin-density 
full-potential  linearized  augmented  plane-wave  calculations.  Changes  in  electronic  properties  with  coverage 
and  the  evolution  of  the  bulk  electronic  structure  of  the  substrate-stabilized  fee  iron  are  described. 


Advances  in  (i)  novel  sample  preparation  techniques,  in 
particular  epitaxial  growth  of  layered  structures,  and  (ii)  in 
local  spin-density  electronic-structure  theory  of  surfaces  and 
interfaces  are  stimulating  new  interest  and  excitement  in  the 
field  of  thin-film  magnetism.  Taken  together,  they  offer 
unique  opportunities  for  developing  new  magnetic  materials' 
as  well  as  advancing  our  understanding  of  fundamental 
magnetic  interactions  in  solids.2  The  opportunities  for  ad¬ 
vancing  our  fundamental  understanding  of  magnetic  phe¬ 
nomena  are  particularly  attractive  in  the  subfield  of  solid- 
state  physics  in  which  carefully  characterized  materials  of 
known  structure  are  experimentally  studied  and  the  results 
coordinated  with  first-principles  calculations. 

In  a  recent  publication3  we  reported  experimental  results 
for  epitaxial  /><lxl)Ni  films  on  Cu(lOO).  This  study 
presented  some  of  the  first  detailed  experimental  evidence 
showing  that  high-quality  epitaxial  magnetic  films  could  be 
grown  on  metallic  single-crystal  surfaces,*  and  that  the  two- 
dimensional  electronic  structure  and  magnetic  exchange 
splitting  of  these  films  could  be  determined  with  sufficient 
accuracy  to  provide  meaningful  tests  of  the  predictions  of 
first-principles  calculations.3 

The  present  Rapid  Communication  continues  to  explore 
the  prospects  of  advancing  our  understanding  of  two-di¬ 
mensional  magnetic  structures  based  on  the  interplay 
between  ab  initio  first-principles  calculations  and  photoemis- 
sion  studies  of  novel  thin-film  structures  fabricated  by 
molecular-beam  epitaxy.  Our  experimental  results  confirm 
the  predictions  of  the  computational  studies,3  and  indicate 
that  the  p(  1  x  1  )Fe  on  Cu(lOO)  system  is  a  second  suitable 
candidate  for  detailed  experiments  in  which  electronic  and 
magnetic  properties  can  be  probed  by  the  rapidly  increasing 
number  of  surface  and  spin  sensitive  spectroscopic  tech¬ 
niques 6 

Our  experiments  were  performed  at  the  Synchrotron  Ra¬ 
diation  Facility  in  Stoughton,  Wisconsin.  The  1-m  stain¬ 
less-steel  Seya-Namioka  monochrometer  was  used  to 
dispense  radiation  from  the  Tantalus  storage  ring,  and  an 


angle-resolving  photoelectron  spectrometer,  described  previ¬ 
ously,3,7  was  used  to  prepare  the  epitaxial  crystals  and  to  ob¬ 
tain  the  photoelectron  spectra.  Our  sample  preparation 
techniques  and  surface  characterization  methods  were  also 
identical  to  those  described  previously.3 

The  theoretical  electronic  structures  were  determined 
from  local  spin-density  functional  theory  by  means  of  the 
highly  precise  all-electron  full-potential  linearized  augment¬ 
ed  plane-wave  (FLAPW)  method.8  The  surfaces  are  mod¬ 
eled  by  a  single-slab  geometry  with  Fe  layer(s)  atop  a  five- 
layer  Cu(001)  substrate:  The  stacking  has  atoms  in  the 
fourfold  hollow  site  of  adjacent  atomic  planes.  The  Fe-Cu 
interlayer  spacing  was  determined  from  total  energy  calcula¬ 
tions.  We  find  that  Fe  forms  an  ordered  overlayer  on 
Cu(001)  with  an  fe-Cu  interlayer  spacing  which  is  very 
close  (within  0  05  A)  to  that  of  the  substrate.  This  result  is 
confirmed  by  low-energy  electron-diffraction  (LEED)  stud¬ 
ies  described  later.  For  the  case  of  two  monolayers 
p(  1  x  l  )Fe/Cu(001 ),  the  problem  is  complicated  by  the 
magnetic  coupling  between  the  Fe  layers.  The  magnetic 
ground  state  was  therefore  determined  from  a  comparison 
of  spin-polarization  energy  between  various  magnetic  states. 
The  ferromagnetic  coupled  bilayers  are  found  to  have  the 
lowest  total  energy,  whereas  the  antiferromagnetic  coupling 
between  the  Fe  layers  exists  as  a  metastable  state  with  a  to¬ 
tal  energy  0.2  eV  above  the  ferromagnetic  state. 

Despite  the  presence  of  the  nonmagnetic  Cu  substrate, 
strongly  enhanced  magnetic  moments  localized  at  the  Fe 
site  are  found  from  these  calculations:  (1)  2.85/ifl  for  1 
Fe/Cu(001);  and  (2)  2.83jzs  and  2.58*is  for  the  surface  and 
interface  Fe  layers,  respectively,  for  the  two  monolayer  cov¬ 
erage.  The  Fe-derived  localized  interface  states  and  the  nar¬ 
rowing  of  the  d  band  appears  to  be  the  mechanism  driving 
the  enhancement  of  the  magnetic  moments  over  the  value 
(2 . 1 2/x  0  >  in  bulk  Fe. 

Previous  experimental  work’  has  established  that  the  ex¬ 
cellent  bulk  lattice  constant  match  permits  pseudomorphic 
growth  of  fee  Fe  on  Cu(001).  Evidence  for  pseudomorphic 
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growth  iso  based  on  transmission  electron  microscopy  studies 
of  1000  A  Fe  films  grown  on  Cu(100)  films  in  high  vacuum 
(10-7  torr).  Unlike  the  Ni  on  Cu(100)  system,  which  we 
previously  studied,3  apparently,  no  structure  studies  of 
1 -5-monolayer  films  grown  on  well-characterized  substrate 
surfaces  in  10_,0-torr  vacuum  are  currently  available  for  the 
Fe  on  Cu(iOO)  system.  Therefore,  we  have  conducted  an 
extensive  investigation  of  the  growth  properties  and  struc¬ 
ture  of  thin  Fe  layers  on  Cu(100)  surfaces.  Our  LEED  and 
Auger  analysis  of  this  system  are  reported  elsewhere.10  The 
important  results  of  this  work  relevant  to  the  present  paper 
are  (1)  p(lxl)Fe  grows  on  Cu(lOO)  as  an  extension  of 
the  substrate  with  identical  (  ±0.1  A)  lattice  constant,  (2) 
thicker  films  (up  to  4  layers)  appear  to  form  excellent  epi¬ 
taxial  layers  of  fee  iron  stabilized  by  the  Cu(lOO)  substrate 
and  having  a  lattice  constant  equal  to  the  substrate,  (3)  in¬ 
terdiffusion  on  Fe  and  Cu  at  the  interface  is  not  apparent 
for  substrate  temperatures  below  250  °C,  and  (4)  the  epitax¬ 
ial  growth  appears  to  be  dominated  by  a  layer-by-layer 
mechanism  at  substrate  temperatures  of  150  °C. 

Figure  1  displays  representative  angle-resolved  photoemis¬ 
sion  spectra  for  one  and  two  monolayers  of  p(lxl)Fe  on 
Cu(100)  along  the  T -X  direction  of  the  two-dimensional 
Brillouin  zone.  All  of  our  energy  distribution  curves 


(EDC’s)  for  epitaxial  layers  were  taken  using  s -polarized 
light  with  the  A  vector  along  a  symmetry  axis  of  the  crystal 
and  with  the  emitted  electrons  detected  either  in  that  mirror 
plane  (even  symmetry)  or  perpendicular  to  it  (odd  sym¬ 
metry).  All  spectra  were  taken  at  room  temperature  (300 
K)  at  an  energy  resolution  (monochromator  plus  analyzer) 
of  approximately  100  meV.  Approximately  200  spectra  for 
one-  and  two-monolayer  Fe  films  on  CuUOO)  were  utken  in 
even  and  odd  geometry  for  kH  along  T -X  and  V -M  direc¬ 
tions  of  the  two-dimensional  Brillouin  zone.  Several  photon 
energies  were  used.  Except  for  some  minor  variations  of 
binding  energies  with  film  thickness  (discussed  below), 
spectra  corresponding  to  a  given  symmetry  and  value 
were  consistent. 

Figure  2  presents  the  two-dimensional  electronic  structure 
of  one-  and  two-layer  films  obtained  from  our  photoemis¬ 
sion  data.  Solid  lines  and  dashed  lines  in  Fig.  2  represent 
calculaled5  majority  spin  and  minority  spin  bands  for  a 
p(lxl)Fe  film  on  a  five-layer  Cu(100)  slab  which  have 
over  50%  of  their  wave  function  derived  from  Fe  basis 
functions.  These  are  the  specific  two-dimensional  energy 
bands  to  which  our  experiments  should  be  most  sensitive. 


p(IXI)  LAYER  k,.=  l23A-y 

Fe/Cu  (IOO)  r  \  / 
ODDJSYMMETRY  ■  V 
r  -X  DIRECTION;  \  »\  t  . 

h*«  21.22  eV  j  \^/J  '''' 

.  I  LAYER  ;  ft,  "  J 

-  2  LAYERS  /  A:  / 


k„  =  I  00  A" 


p  (I  X  I )  MONOLAYER  Fe/Cu(IOO) 
EVEN  SYMMETRY 


150  IOO  050 


050  IOO 


°  S  ft 


he  »  16  85  »V 


ODD  SYMMETRY 


r.v  •« 

hW-. 


k„  =0  75  A" 


k„  =0  50A- 


k„  =0  25A" 
k„  =0  0A" 


-7  -6  -5  -4  -3  -2  -I  E, 

BINDING  ENERGY  (eV) 

FIG.  1  Angle-resolved  photoemission  spectra  for  one-  and  two- 
layer  p(lx  l)Fe  films  on  Cu(lOO)  Values  of  ktl  correspond  to  the 
f  -X  direction  of  the  two-dimensional  Brillouin  zone. 
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FIG  2  Two-dimensional  electronic  structure  of  plUDFe  on 
Cu(IOO)  The  two  broad  curves  indicate  the  regions  of  binding  en¬ 
ergy  and  *ii  where  a  prominent  structure  resulting  from  the  Cu  sp 
band  is  observed  Light  solid  and  dashed  curves  represent  calculat¬ 
ed  (Ref  5)  surface  Fe  bands  having  over  50%  surface  character 
Data  are  represented  by  empty  (two-monolayer  films)  and  solid 
(one-monolayer  films)  circles  (iiv-  16  85  eV)  and  rectangles 
f  Av  — 21  II  eV) 
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The  broad  shaded  areas  in  the  even  symmetry  bands 
represent  regions  of  kn  where  prominent  features  associated 
with  the  Cu  sp  band  sweep  through  the  angle-resolved 
energy-distribution  curves  (EDC’s)  near  Ef.  These  struc¬ 
tures  were  identified  by  kn  scans  on  clean  Cu(lOO)  sur¬ 
faces;  their  presence  precludes  the  study  of  even  symmetry 
Fe  bands  in  several  regions  of  the  two-dimensional  Brillouin 
zone  at  specific  photon  energies.  Interference  of  the  sp 
band  in  studies  of  pi  1  x  1  )Ni  on  Cu(lOO)  (Ref.  3)  and  on 
Ag(lOO)  (Ref.  11)  is  less  of  a  problem  because  the  over¬ 
layer  Ni  d-band  features  are  much  narrower  than  corre¬ 
sponding  Fe  features,  and  are  therefore  more  easily  dis¬ 
tinguished  from  the  s/r-band  features. 

Some  results  of  the  theoretical  studies  are  shown  in  Fig. 
2.  It  is  clear  from  the  comparison  that  the  calculations  are 
in  reasonably  good  agreement  with  the  results  shown  in  Fig. 
2  in  the  regions  of  the  two-dimensional  Brillouin  zone  effec¬ 
tively  probed  by  our  experiments.  Based  on  the  calcula¬ 
tions,  the  most  prominent  features  observed  in  even  sym¬ 
metry  geometry  in  the  vicinity  of  EF  should  be  minority 
spin  bands.  Clearly,  spin-polarized  photoemission  studies  of 
this  system  will  be  informative.  Our  experimental  results 
around  T,  where  Fe-derived  features  are  not  masked  by  the 
Cu  sp  band,  support  the  number  of  bands  and  their  binding 
energies  predicted  by  the  calculations.  Odd  symmetry  bands 
observed  along  T -X  also  appear  to  agree  rather  well  with  the 
calculated  results,  although  there  are  significant  differences 
(0.25  eV)  between  measured  and  calculated  binding  ener¬ 
gies.  In  general,  we  find  that  the  measured  bands  lie  closer 
to  the  Fermi  energy  than  the  calculations  predict  for  the 
majority  and  minority  spin  bands.  This  result  could  imply 
that  the  magnetic  exchange  splitting  is  actually  smaller  than 
predicted  theoretically,  or  could  be  a  manifestation  of  the 
importance  of  many-body  effects  (i.e.,  core-hole  relaxation 
and  correlation).  Our  previous  experimental  studies  of  the 
bulk7  and  surface12  electronic  properties  of  ferromagnetic  Fe 
have  shown,  however,  that  correlation  effects  appear  to  be 
small;  we  therefore  assume  that  these  effects  are  also  of 
minor  importance  in  the  Fe  overlayer  system. 

One  source  of  the  discrepancy  between  the  experiments 
and  the  theoretical  results  could  be  due  to  the  fact  that  our 
measurements  were  conducted  at  300  K,  and  our  calcula¬ 
tions  correspond  to  f  =  0  K.  We  have  not  yet  attempted 
any  low-temperature  experiments  or  experiments  above 
T  =300  K  to  detect  possible  temperature  dependences  in 
the  results.  A  second  possible  source  of  disagreement 
between  our  experimental  results  and  the  calculations  are 
some  subtle  layer  dependencies  we  have  observed  in  elec¬ 
tronic  properties  of  the  Fe  layers.  Our  LEED  studies  re¬ 
vealed  two  domain  (2x1)  LEED  patterns  in  the  coverage 
range  between  0  5  and  10  monolayers.10  These  structures 
were  not  observed  above  full  monolayer  coverage,  but  their 
presence  at  low  coverage  suggests  a  competition  between 
the  substrate  stabilized  fee  structure  and  the  possible  ten¬ 
dency  for  Fe  to  form  some  other  structure,  in  particular  the 
normal  bcc  structure  of  bulk  Fe  Displacement  of  iron 
atoms  too  small  to  detect  by  our  LEED  measurements  is 
not  inconceivable,  and  such  displacements  would  cause  sig¬ 
nificant  changes  in  the  electronic  properties,  particularly  in 
the  F -M  portion  of  the  two-dimensional  Brillouin  zone  As 
noted  in  detail  elsewhere.10  while  we  have  eliminated  film 
growth  models  that  assume  the  second  layer  begins  before 
the  first  layer  is  more  than  85%  complete,  we  cannot 


discriminate  between  film  growth  in  which  islands  of  the 
second  layer  begin  to  nucleate  when  there  is  less  than  15% 
of  the  first  layer  remaining  to  be  completed. 

The  calculated  bands  for  two  layers  of  Fe  and  Cu(001) 
are  modified  from  those  of  the  monolayer  coverage.  The 
overall  Fe  surface  layer  derived  features  and  dispersions 
remain  similar  to  those  shown  plotted  in  Fig.  1.  However, 
the  localization  of  the  surface  state  wave  functions  on  the 
surface  Fe  layer  becomes  less  pronounced  due  to  the  hy¬ 
bridization  between  surface  and  subsurface  Fe  layers.  This 
change  of  bonding  also  manifests  itself  in  a  reduced  upward 
dispersion  of  odd  symmetry  bands  (dominated  by  il„  and  dr_ 
character)  along  the  V -M  direction  (cf.  Fig.  2).  Further¬ 
more,  the  calculation  predicts  the  existence  of  additional 
majority  spin  bands  having  a  strong  admixture  of  wave 
functions  localized  on  the  subsurface  Fe  layer,  in  particular, 
odd  symmetry  bands  with  binding  energy  ==2  eV  along  F- 
T,  and  an  upward  dispersion  band  along  T  -M  (not  shown  in 
Fig.  2).  Interference  from  the  Cu  sp  band  precludes  observ¬ 
ing  the  new  even  symmetry  band  for  two  monolayers,  and 
the  broad  peak  widths  associated  with  the  Fe  overlayers 
renders  it  very  difficult  to  differentiate  small  differences  in 
spectra  for  one-  or  two-monolayers.  Since  the  size  of  the 
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FIG  3  Normal  emission  photoemission  spectra  for  four-layer  fee 
Fe  films  stabilized  on  the  Cut  100)  surface  Peaks  near  E,  result 
from  the  Fe  film  and  exhibit  clear  evidence  of  direct  bulk  interband 
transitions 
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magnetic  moments  for  the  surface  Fe  layers  remains  almost 
identical  for  one-  or  two-monolayer  coverage,  the  exchange 
splitting  (A£„)  is  expected  to  be  the  same  for  both  cases 
( A£e,  =  2.65  ±0.05  eV). 

Figure  3  displays  normal  emission  spectra  for  four  layers 
of  Fe  on  Cu(lOO)  Our  LEED  analysis10  has  shown  that 
high-quality  fee  films  of  epitaxial  Fe  form  at  this  coverage. 
Features  in  the  spectra  near  £>  are  definitely  due  to  emis¬ 
sion  from  the  Fe  overlayer.  This  assignment  was  checked 
by  obtaining  corresponding  spectra  for  clean  Cu(lOO). 
Peaks  near  £>  exhibit  clear  dispersion  with  photon  energy 
tki>,  indicating  strong  influence  of  direct  bulk  transitions. 
Dipole  selection  rules  limit  the  symmetry  of  initial  states 
probed  in  normal  emission  to  A|  and  A>  symmetry.  The 
dispersion  of  the  peaks  near  £>  with  k  L  is  consistent  with 
the  calculated  bulk  bands' 1  of  fee  Fe  along  the  F -X  direction 
of  the  three-dimensional  Brillouin  zone.  It  is  therefore  pos¬ 
sible  to  carry  out  detailed  energy  band  measurements  of  the 
bulk  band  structure  of  the  fee  phase  of  ferromagnetic  iron 
stabilized  on  Cu(lOO).14 

In  summary,  our  results  have  identified  a  second  thm- 
film  magnetic  system  in  which  considerable  success  has 
been  achieved  in  the  synthesis  of  the  film,  in  its  character¬ 


ization,  and  in  obtaining  accurate  electronic  structure  infor¬ 
mation.  The  p(  \  x  1)  Fe  on  Cu(100)  appears  to  represent 
an  additional  excellent  model  system  in  which  to  explore  the 
relationship  between  magnetism  and  electronic  structure 
from  the  point  of  view  of  thin-film  magnetism  (two- 
dimensional  magnetism),  and  the  magnetism  of  a  new  artifi¬ 
cially  stabilized  bulk  phase  (fee  Fe). 
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Laser-induced  damage  and  ion  emission  of  GaAs  at  1 .06 


Austin  L.  Huang,  Michael  F.  Becker,  and  Rodger  M.  Walser 


This  study  focused  on  the  multipulse  laser  damage  and  the  subdamage  threshold  ion  emission  of  GaAs.  The 
initial  goals  were  to  determine  the  pulse-dependent  damage  threshold  and  to  correlate  ion  emission  with 
surface  damage.  A  Q-switched  Nd:YAG  laser  was  used  to  irradiate  the  ( 100  >  GaAs  samples.  Using  values  of 
■V  from  1  to  100,  we  obtained  accumulation  curves  based  on  50%  damage  probability.  Corresponding  damage 
threshold  fluences  were  0. 4-0.8  J/cm2  for  N  >  1  and  1.5  J/cm2  for  N  «  1  We  observed  large  site-to-site 
fluctuations  in  ion  emission  and  found  the  onset  of  emission  at  0  2  J/cm2  for  all  cases.  Once  surface  damage 
occurred,  ion  emission  increased  greatly.  The  observed  behavior  supports  a  surface  cleaning  model  for  the 
ion  emission  which  precedes  surface  damage  Measurements  of  linear  and  nonlinear  free  carrier  absorption 
were  made,  but  no  anomalous  absorption  was  observed. 


I.  Introduction 

The  interaction  between  laser  radiation  and  solids 
has  been  a  perplexing  problem  for  many  years.  Often 
the  lifetime  of  an  optical  device  is  determined  by  its 
susceptibility  to  optical  damage.  For  example,  it  is 
still  a  significant  problem  that  the  performance  of 
GaAs  injection  lasers  degrades  at  different  rates  with 
respect  to  power  level  thus  implying  an  accumulation 
effect.1  There  has  been  much  controversy  and  unex¬ 
plained  phenomena  associated  with  the  energy  trans¬ 
fer  mechanism  of  normal  laser  damage  as  well  as  the 
physical  nature  of  surface  damage. 

The  objectives  of  the  experiments  reported  here 
were  to  characterize  the  statistical  nature  of  surface 
damage  for  GaAs,  particularly  for  multiple  pulses  on 
one  site  (N-on-l),  to  observe  the  relationship  of 
charged  particle  emission  to  surface  damage,  and  to 
observe  the  damage  morphology  of  GaAs.  In  silicon  it 
has  been  reported  that  charged  particle  emission  is 
coincident  with  surface  damage.2  A  relation  between 
charged  particle  emission  and  surface  damage  has  yet 
to  be  reported  for  GaAs.  To  explore  the  statistical 
nature  of  surface  damage  of  GaAs,  we  performed  vari¬ 
ous  single-shot  and  N-on-l  subthreshold  laser  tests. 
We  measured  the  positive  charged  particles  emitted 
during  each  laser  pulse  to  correlate  the  charge  emission 
events  with  surface  damage.  Theoretical  calculations 
for  a  thermal  model,  as  well  as  a  plasma  production 
model,  have  been  carried  out  to  determine  the  mecha¬ 
nism  responsible  for  the  surface  damage. 
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H.  Samples  and  Apparatus 

The  GaAs  samples  used  in  the  experiment  were  sup 
plied  by  the  microwave  integrated  circuit  production 
group  at  Texas  Instruments  in  Dallas,  TX.  The  sam 
pies  had  a  (100)  orientation  and  were  very  lightly 
doped  with  chromium  (*1  X  10lfi  cm'1)  but  otherwise 
undoped  and  unannealed.  These  samples  were  char 
acterized  by  a  resistivity  of  >1  X  107  0-cm  and  an  etch 
pit  density  of  40-60  X  103  cm'2.  Only  the  front  face  of 
the  wafer  was  polished  to  optical  quality. 

To  prepare  the  sample  for  the  vacuum  tank  we  se 
quentially  cleaned  the  GaAs  wafers  in  boiling  solvents 
of  trichlorethylene,  acetone,  and  methanol  to  assure 
that  all  contaminants  had  been  removed.  While  in 
each  solvent,  the  wafer  was  ultrasonically  cleaned  be 
fore  proceeding.  After  the  final  cleaning  in  methanol, 
the  wafer  was  rinsed  in  deionized  water  (>5  min  at 
room  temperature)  and  placed  into  the  vacuum  tank 

To  measure  the  linear  and  nonlinear  absorption  in 
the  transmission  tests,  the  GaAs  wafers  were  given  an 
optical  quality  finish  on  both  sides.  The  backside  of 
the  wafer  was  mechanically  polished  in  a  two-step 
process.  Rough  polishing  with  a  grinding  pad  and  6- 
*im  diamond  ferrous  paste  was  used  to  initially  buff  the 
wafer.  The  finishing  pad  used  a  liquid  0.05-jim  alumi 
na  nonferrous  suspension  as  a  grinding  media.  After 
polishing,  the  wafer  was  chemically  cleaned  as  de 
scribed  above. 

The  experimental  system  is  shown  schematically  in 
Fig.  1.  It  utilized  a  Q-switched  Nd:YAG  laser  with  a 
full  width  at  half-maximum  (FWHM)  pulse  length  of 
45  ns,  TEMix)  transverse  mode,  and  wavelength  of 

I. 064  >im.  Although  the  pulse  envelope  was  Gaussain. 
the  laser  was  not  single  longitudinal  mode.  A  knife 
edge  scanning  technique  was  used  to  measure  the  fo¬ 
cused  spot  diameter  on  the  sample  surface,  which  was 
found  to  be  580  ^m.  The  laser  energy  fluctuated  from 
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Fig  1.  Experimental  setup 


pulse  to  pulse  resulting  in  a  2-5%  standard  deviation  in 
energy  for  each  data  run.  The  laser  was  operated  at  a 
15- Hz  repetition  rate,  and  the  incident  energy  was 
attenuated  by  rotating  a  halfwave  plate  in  front  of  a 
polarizer. 

A  computer  was  utilized  to  control  and  count  the 
laser  pulses  striking  the  GaAs  sample  and  to  record  the 
charge  collected.  The  computer  had  a  Z80  micro¬ 
processor  and  supported  several  data  acquisition  ports 
via  sample  and  hold  amplifiers  and  a  multichannel  A- 
D  converter.  The  delay  for  the  sample  and  hold  am¬ 
plifiers  was  set  to  130  ps  in  hardware.  This  assured 
that  the  integrated  values  read  for  the  charge  emission 
were  near  the  peak  value  but  were  collected  well  after 
the  laser  noise  burst.  The  electronics  were  triggered 
by  a  vacuum  photodiode.  A  shutter,  controlled  by  one 
of  the  computer’s  digital  output  ports,  was  used  to 
select  pulses  for  sample  irradiation.  To  allow  time  for 
data  computation  and  storage  betwee  pulses,  the  shut¬ 
ter  selected  every  third  pulse  from  the  laser  train. 

A  760-mm  focal  length  lens  focused  the  beam  onto 
the  sample.  The  long  focal  length  gave  a  region  of 
constant  spot  size  ~3  mm  deep  at  the  focal  plane  and 
permitted  greater  error  in  the  placement  of  the  sample. 
A  system  of  90°  prisms  was  placed  after  the  lens  to  scan 
the  beam  across  the  sample  in  the  vacuum  tank. 

To  detect  charged  particle  emission  the  sample  and 
detection  device  were  placed  into  the  vacuum  environ¬ 
ment.  The  system  was  dry  pumped  to  achieve  experi¬ 
mental  pressures  of  <1  X  10-6  Torr.  A  Hamamatsu 
electron  multiplier  tube  model  R596  served  as  the  ion 
detector  and  was  placed  inside  the  vacuum  system  at  a 
36°  angle  to  the  laser  beam  path.  The  sample  surface 
was  normal  to  the  optical  path.  The  output  of  the 
electron  multiplier  was  connected  to  a  capacitive  volt¬ 
age  divider  and  amplifier.  The  dynamic  range  of  the 
data  acquisition  system  was  increased  by  making  one 
channel  100X  less  sensitive  than  the  other.  The  elec¬ 
tron  multiplier  current  gain  was  estimated  to  be  ~0.75 


,i, .% 


X  10®  at  a  voltage  of  —2  kV  at  the  first  dynode.  A  data 
value  of  1000  in  the  more  sensitive  A-D  converter 
channel  corresponded  to  a  charge  of  ~1.5  X  10' 17  C. 
This  —  2-kV  bias  potential  not  only  determined  the 
gain  of  the  electron  multiplier,  but  it  created  an  attrac¬ 
tive  potential  sufficient  to  collect  all  emitted  positive 
ions. 

M.  Experimental  Data 

Three  experiments  were  conducted  on  the  cleaned 
samples:  ( 1 )  an  emission  scan  of  the  GaAs  sample  at  a 
fixed  laser  fluence;  (2)  an  N-on-1  emission/damage 
threshold  test;  and  (3)  a  transmission  test.  All  the 
tests  were  performed  in  vacuum  with  the  exception  of 
the  transmission  test.  The  calibration  and  beam  pro¬ 
files  were  checked  at  the  start  and  end  of  each  experi¬ 
mental  session  to  assure  accurate  beam  fluence  mea¬ 
surements. 

The  emission  scan  of  the  GaAs  sample  checked  the 
uniformity  of  charge  emission  under  constant  laser 
fluence.  The  scan  spots  were  separated  by  0.8  mm, 
and  forty-two  samples  were  taken.  The  sample  was 
irradiated  with  a  constant  fluence  of  0.63  J/cm2  (±  7%). 
The  results  in  Fig.  2  show  a  bilevel  emission  contour. 
A  different  pattern  of  emission  variations  was  ob¬ 
served  on  each  sample.  Site-to-site  fluctuations  of  the 
defect  density  in  GaAs  have  been  previously  noted3 
and  could  disrupt  the  statistics  of  charged  particle 
emission  and  the  emission  threshold.  To  minimize 
the  effects  of  these  site-to-site  variations,  we  conduct¬ 
ed  the  emission/damage  tests  over  a  small  area  of  the 
wafer. 

Other  experiments  were  concerned  with  the  multi¬ 
pulse  and  single-pulse  irradiance  of  GaAs.  In  the  mul¬ 
tipulse  experiments,  fluences  below  the  single-pulse 
damage  threshold  were  used  to  search  for  accumula¬ 
tion  effects  associated  with  either  emission  or  damage. 
The  objectives  of  these  experiments  were  (1)  to  identi¬ 
fy  accumulation  effects  associated  with  surface  dam- 
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Fig.  3.  Damage  and  emission  probabilities  vs  fluence  for  N  =  1. 
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age,  (2)  to  measure  a  damage  threshold  based  on  50% 
probability  statistics,  (3)  to  search  for  correlations  be¬ 
tween  surface  damage  and  charged  particle  emission, 
and  (4)  to  determine  the  damage  morphology  as  a 
function  of  fluence  and  number  of  pulses. 

For  each  value  of  the  number  of  pulses  N  used  {N  = 
1, 3, 10, 30,  and  100)  we  irradiated  150  to  200  different 
sites  to  obtain  the  statistics  of  surface  damage.  Sur¬ 
face  damage  was  identified  by  searching  for  surface 
changes  with  a  Normarski  optical  microscope  at  200  X. 
The  damage  data  for  N  =  1, 10,  and  100  are  plotted  in 
Figs.  3-5,  respectively.  In  each  plot  we  used  a  linear 
curve  fitting  program  to  obtain  the  damage  probabili¬ 
ty.  From  the  linear  fit  we  obtained  the  50%  probabili¬ 
ty  fluence  for  each  value  of  N  and  used  this  value  in  the 
accumulation  plot  in  Fig.  6. 

For  all  values  of  N,  we  observed  the  onset  of  charged 
particle  emission  at  an  average  fluence  of  0.2  J/cm2 
(±0.07  J/cm2).  The  values  for  the  onset  of  emission 
and  the  50%  damage  threshold  are  given  in  Table  I. 
From  these  damage  fluence  values  and  their  plot  in 
Fig.  6,  it  is  apparent  that  an  accumulation  effect  was 
present  in  that  the  threshold  decreased  after  the  first 


Fig.  5.  Damage  and  emission  probabilities  vs  fluence  for  N  =  100. 
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Fig.  6.  Number  of  pulses  required  to  reach  50^  damage  probability 
vs  fluence.  The  line  was  hand  drawn  to  aid  the  viewer 
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TaM*  I.  On**t  of  Charge  Emission  and  50  %  Damaga  Threshold  Fkisncs 
for  Af-on- 1  Exparfmants 
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pulse.  As  the  number  of  laser  pulses  incident  on  the 
wafer  increased,  the  50%  damage  fluence  decreased 
and  then  leveled  off.  It  is  not  clear  from  this  scattered 
data  whether  the  decrease  continues  slowly  for  N  >  1 
or  the  curve  is  level.  Scatter  in  the  threshold  data  and 
the  related  observation  of  low  slopes  in  the  damage 
probability  vs  fluence  curves  have  been  associated 
with  the  presence  of  the  local  defects.4  This  possibili¬ 
ty  will  be  correlated  with  damage  morphology  in  a  later 
section.  Similar  curv  es  to  the  one  shown  in  Fig.  6,  but 
showing  a  monotonic  decrease  in  threshold,  have  been 
reported  for  metals,5  polymethylmethacrylate,  and 
modified  polymethylmethacrylate.8  The  single-shot 
50%  damage  fluence  for  GaAs  is  in  agreement  with 
values  previously  measured." 8 

The  positive  charged  particle  emission  data  were 
collected  on  a  pulse-by-pulse  basis.  A  typical  profile 
for  N  =  30  is  shown  in  Fig.  7.  The  site  from  which  this 
emission  profile  was  obtained  was  damaged.  From 
these  data  we  note  that  the  first  few  pulses  (from  two 
up  to  ten  depending  on  the  case)  induced  a  charge 
emission  that  decreased  as  the  pulse  number  in¬ 
creased.  After  this,  the  emission  increased  greatly. 
Since  the  microscopy  and  determination  of  damage 
were  done  after  termination  of  the  experiment,  we 
were  unable  to  fix  accurately  the  pulse  number  at 
which  damage  was  initiated.  Visual  observation  of  the 
initial  flash  was  a  somewhat  insensitive  measure  of 
damage  initiation.  We  interpret  the  decrease  in  the 
first  few  pulses  as  a  surface  cleaning  effect.  These  first 
pulses  cleaned  any  residue  either  left  from  the  chemi¬ 
cal  cleaning  process  or  ejected  from  previously  laser 
irradiated  sites.  This  emission  could  also  be  due  in 
part  to  the  depletion  of  the  more  volatile  specie,  As, 
from  the  surface  atomic  layers.  Figure  7  also  shows 
that,  once  the  site  experiences  surface  damage,  charge 
emission  is  greatly  increased.  The  emission  at  non- 
damaged  sites  always  decreased  with  increasing  num¬ 
ber  of  pulses. 

Figures  8  and  9  are  SEM  micrographs  showing  the 
development  of  laser-induced  surface  damage  mor¬ 
phologies.  The  initial  surface  was  featureless  and 
without  contrast.  In  the  Normarski  microscope,  the 
initial  surface  change  appeared  as  a  depression  with  an 
area  equal  to  that  of  the  laser  spot.  Within  this  area, 
there  were  several  very  small  pitted  regions  (similar  to 
those  shown  in  Fig.  8  but  not  as  well  defined).  As  the 
fluence  increased,  or  the  number  of  pulses  increased, 
the  initial  surface  damage  evolved  into  the  melt  pits 
shown  in  Fig.  8.  Similar  damage  pits  have  been  seen  in 
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dN/dt  =  aj/hf  +  Sj'l-lhi  -  NH,  . 

dl/dz  =  -  dj1  -  oNl , 


(2) 


Intensity  (MW/cm^) 

Kifi.  10  Inverse  transmission  vs  intensity  for  a  GaAs  wafer  polished 
on  both  sides.  The  solid  line  represents  a  least -squares  linear  fit  to 
the  data. 
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GaAs  at  11)64  pm.)  :s  The  damage  evolution  ends 
with  severe  cratering  as  shown  in  Fig.  9.  The  theory  of 
Fauchet  and  Siegman51  would  seem  to  apply  in  this 
case.  They  postulate  that  laser  initiated  ripple  pat¬ 
terns  on  the  surface  of  silicon  and  GaAs  are  formed 
from  the  interaction  of  the  incident  laser  wave  front 
with  scattered  optical  surface  waves.  At  1.064  fim, 
however,  with  a  pulse  length  of  45  ns  and  spot  diameter 
of  580  pm.  no  ripple  patterns  were  seen  in  our  experi¬ 
ments.  Accordingly,  the  longer  pulse  lengths  may  in¬ 
hibit  ripple  formation. 

IV.  Discussion  and  Models 

Thermal  models  have  been  used  in  the  past  to  ex¬ 
plain  both  laser  annealing  processes  and  laser  damage 
phenomena.  These  models  assumed  uniform  heating, 
and  the  possibility  of  inhomogeneities  was  not  includ¬ 
ed.  The  necessity  of  inhomogeneous  processes  is 
made  apparent  by  examining  the  laser  absorption  in 
the  sample.  From  the  slope  and  intercept  of  the  linear 
fit  of  the  inverse  transmission  vs  intensity  shown  in 
Fig.  10,  the  linear  and  nonlinear  absorption  behavior 
may  be  determined.  Although  small  values  of  linear 
absorption  coefficient  a„  are  not  measured  accurately 
by  this  method,  a  value  for  n0  is  a  by-product  of  the 
curve  fitting  procedure.  The  linear  absorption  coeffi¬ 
cient  was  found  using  the  equation 

7',,  =  ( 1  -  R lJ  expl-u,^.) .  (1) 

where  T„  is  the  zero  fluence  transmission  intercept,  R 
is  the  reflectivity  at  the  air-gallium  arsenide  wafer 
interface,  and  L  is  the  thickness  of  the  GaAs  sample. 
In  our  case  To  was  0.45  (from  Fig.  10),  R  was  calculated 
to  be  0.306  based  on  tabulated  refractive-index  data, 
and  L  =  0.0635  cm.  Substituting  these  values  into  Eq. 
(1)  and  solving  yield  «()  =  1.2  cm-1  (±40%).  The  free 
carrier  or  nonlinear  absorption  can  be  determined 
from  a  linearized  equation  for  inverse  transmission  vs 
intensity  as  shown  in  the  following  analysis.  The  cou¬ 
pled  differential  equations  describing  the  carrier  den¬ 
sity  and  optical  intensity  are 
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where  N  =  N(x,y,z,t)  is  the  time-  and  space-dependent 
carrier  density,  I  =  I(x,y,z,t)  is  the  time-  and  space- 
dependent  optical  intensity,  hf  is  the  photon  energy,  tc 
is  the  free  carrier  lifetime,  do  is  the  two-photon  absorp¬ 
tion  coefficient,  and  a  is  the  free  carrier  optical  cross 
section.  In  Eq.  (2),  carrier  relaxation  due  to  Auger 
recombination  has  been  neglected  due  to  the  low  ex¬ 
pected  carrier  densities.  At  10  MW/cm2,  for  example, 
two-photon  absorption  is  negligible  with  respect  to  one 
photon  and  free  carrier  absorption;  (3qI  =  0.23  cm-1 
and  do  =  0.023  cm/MW.10  For  this  reason,  two-photon 
absorption  will  be  neglected  in  both  Eqs.  (2)  and  (3). 
Also  in  Eq.  (2)  we  have  assumed  that  the  one-photon 
absorption  is  impurity  dominated  and  that  only  one 
charge  carrier  is  generated  per  absorbed  photon. 

The  solution  to  these  equations  will  proceed  much  as 
described  by  Boggess  et  al. 11  except  that  in  our  case  the 
optical  pulse  length  is  much  longer  than  the  free  carrier 
lifetime,  and  the  solution  to  Eq.  (2)  may  be  taken  to  be 
quasi-steady  state.  The  solution  to  Eq.  (2)  for  instan¬ 
taneous  carrier  density  is  given  by 


■V  =  t,„J/hf. 


(41 


This  allows  Eqs.  (3)  and  (4)  to  be  combined  and  solved. 
That  is,  Eq.  (3)  must  be  integrated  over  z  as  well  as  x,y, 
and  t  to  obtain  a  solution  in  terms  of  toal  transmitted 
energy,  which  is  an  experimentally  observable  parame¬ 
ter.  These  integrations  may  be  performed  as  shown  in 
Ref.  11  for  Gaussian  temporal  and  spatial  profiles  if 
the  free  carrier  absorption  is  not  excessively  large. 
This  approximation  is  possible  if  the  intensity  is  less 
than  a  critical  intensity  for  free  carrier  absorption  as 
given  by 

,  ,  hf  1 

I  <  I  =  „  ,  ,  •  i'll 

(1  -  R I  [I  -  exp(-n„/.l| 


For  the  paramaters  in  our  experiments,  /<  =  17.3  MW/ 
cm2.  The  solution  to  Eq.  (3)  for  this  case  is  conven 
tionally  written  in  terms  of  the  inverse  total  energy 
transmission  T,  which  linearizes  the  relationship 

7' _1  =  [II  -  R)~tx p(-, 


ini  l|l  -  exp(-n„/.l| 
hft  1  -  W)[expl -«„/-•! 


(ill 


where  E  is  the  total  transmitted  laser  energy,  tr  is  the 
optical  pulse  width  (FWHM),  and  wq  is  the  laser  spot 
radius  (1/e2  intensity).  The  term  on  the  right  in 
brackets  may  be  thought  of  as  an  effective  intensity 
and  is  the  quantity  plotted  on  the  abscissa  in  Fig.  10. 

A  linear  fit  to  the  data  in  Fig.  10  gives  a  slope  of  40.2 
and  a  corresponding  value  of  (<rt,  )  =  2.0  X  10~2S  s  cm2. 
This  product  of  two  experimentally  measurable  quan¬ 
tities  may  be  checked  against  published  data,  but  un¬ 
fortunately  there  is  uncertainty  and  dispersion  in  the 
published  values.  For  a,  values  ranging  from  5  X  10  -’" 
(Ref.  12)  to  5  X  10'17  cm2  (Ref.  6)  extrapolated  to  1.06 
Mm  have  been  reported.  For  f, ,  one  direct  measure- 


ment  gave  32  ns,1 1  but  this  value  is  known  to  be  strong¬ 
ly  dependent  on  impurities  and  defects  which  increase 
the  recombination  rate.  If  we  use  a  value  of  (,  =  3.2  X 
10"  -  s  (one  of  the  largest  values  reported,  presumably 
for  relatively  pure  material),  our  experiments  predict  a 
free  carrier  cross  section  of  6.3  X  10' **  cm-,  which  is 
well  within  the  range  of  reported  values  given  above 
even  if  the  free  carrier  lifetime  were  a  factor  of  10 
smaller  to  account  for  a  possibly  larger  deep  level  im¬ 
purity  density.  In  addition,  the  functional  depen¬ 
dence  of  T"1  on  intensity  agrees  with  the  theoretically 
predicted  form  as  shown  in  Fig.  10.  From  this,  we 
conclude  that  we  have  included  the  major  homoge¬ 
neous  absorption  mechanisms  in  our  calculations  and 
that  no  gross  anomalous  absorption  was  involved. 

To  determine  the  mechanism  responsible  for  the 
laser-induced  damage,  we  examine  the  temperature 
change  of  the  surface.  Both  linear  and  free  carrier 
absorption  must  be  included  in  the  heating  model. 
Assuming  that  all  the  absorbed  energy  is  thermalized 
without  thermal  or  carrier  diffusion,  the  peak  surface 
temperature  rise  can  be  written 

AT  =  (d//dx),utfrp,/(>  .  (7) 

where  t;,  is  the  pulse  length,  CL  is  the  heat  capacity,  and 
p  is  the  mass  density.  In  this  approximate  calculation 
we  assume  a  rectangular  pulse  shape.  Substituting 
Eqs.  (3)  and  (4)  into  Eq.  (7)  and  taking  the  surface 
reflectivity  into  account  give 

AT  =  [rt0  -Ml  -  «><Tfrrt0/0//./ltl  -  R)I0tp/C,p  ,  (81 

where  /„  is  the  incident  intensity.  For  GaAs  at  room 
temperature,  we  have  Cv  =  0.327  J/g  K  and  p  -  5.32  g/ 
cm’.  Using  a  typical  damage  intensity  for  the  single¬ 
shot  case,  IQ  =  19  MW/cm2  and  the  measured  value  for 
(<rtc),  we  obtain  a  maximum  temperature  rise  of  6.1°C. 
This  diffusionltss  model  assumes  homogeneous  ener¬ 
gy  absorption  and  temperature-independent  reflec¬ 
tion  and  linear  and  nonlinear  absorption  coefficients. 

From  the  results  of  the  previous  calculation  we  con¬ 
clude  that  uniform  heating  is  not  sufficient  to  explain 
the  melting  effects  observed  on  the  GaAs  surface.  A 
similar  conclusion  has  been  reported  for  GaAs  by  Gra- 
syuk  and  Zubarev14  and  for  silicon  by  Merkle  et  al ,1S 
and  Becker  et  al.2  However,  this  does  not  exclude  the 
possibility  of  an  abnormal  absorption  or  heteroge¬ 
neous  absorption  in  defects  at  or  near  the  GaAs  sur¬ 
face. 

An  alternative  process  that  might  lead  to  damage  is 
lattice  disruption  caused  by  a  high-density  carrier 
plasma  produced  during  the  laser-solid  interaction. 
The  possible  relation  of  this  process  to  silicon  anneal¬ 
ing  has  been  reported  previously.16  1 7  To  investigate 
this  possibility,  we  calculated  the  peak  number  of  car¬ 
riers  produced  by  the  impurity  absorption. 

The  peak  carrier  density  produced  is  calculated 
from  Eq.  (4)  where  ambipolar  diffusion  was  neglected 
because  of  the  short  pulse  length.  To  obtain  an  upper 
bound  for  the  carrier  density,  we  used  the  incident 
intensity  of  a  typical  1 -on- 1  damage  pulse,  /  =  19  MW/ 
cm'.  Accounting  for  the  reflectivity,  we  obtained  a 


peak  carrier  density  of  V  =  1.7  X  101"  cm  .  For 
comparison,  the  compensated  intrinsic  carrier  concen¬ 
trations  were  calculated.  Accounting  for  the  chromi¬ 
um  dopant  concentration  (^1  X  1016  cm  3  and  the 
resistivity  of  the  sample  ( >  1  X  10"  cm),  we  calculated 
an  electron  concentration  of  7.35  X  10"  cm"  'and  a  hole 
concentration  of  1.56  X  lO1*  cm"1.  Mobilities  of  5000 
and  300  cm-A' s  were  used  for  the  electrons  and  holes, 
respectively.  Although  the  photogenerated  carrier 
density  is  much  greater  than  the  compensated  intrin¬ 
sic  density,  it  is  still  quite  small  compared  to  the  densi¬ 
ties  where  plasma  effects  become  important,  1011J-10J|' 
cm“  !.  Therefore,  we  can  disregard  the  plasma  model 
from  further  consideration  as  a  primary  damage  mech¬ 
anism  in  GaAs. 

The  charged  particle  detector  system  monitored  the 
pulse-by-pulse  charge  emitted  from  the  GaAs  sample. 
The  magnitude  of  the  emitted  charge  has  been  plotted 
Fig.  7  on  a  pulse-by-pulse  basis  for  a  site  which  exhibit¬ 
ed  surface  damage.  By  examining  the  emission  pro¬ 
files,  the  emitted  charge  was  observed  to  decrease  after 
the  first  few  pulses  and  later  increase  at  least  2  orders 
of  magnitude  when  damage  occurred.  The  relatively 
small  decreasing  emission  profile  of  the  initial  pulses 
appears  to  be  a  surface  cleaning  effect.  The  later 
increase  of  charge  emission  after  the  fifth  pulse  in  this 
case  is  attributable  to  surface  damage.  For  GaAs,  only 
two  types  of  charge  emission  profile  were  observed: 
either  a  cleaning  effect  without  damage  or  a  cleaning 
effect  with  damage.  Damage  in  the  absence  of  a  clean  - 
ing  effect  was  never  observed,  although  at  very  high 
intensities  the  two  profiles  merged  and  became  indis¬ 
tinguishable.  This  differs  greatly  from  the  emission 
characteristics  of  silicon  as  reported  in  Ref.  2.  For 
silicon,  emission  of  charged  particles  coincided  with 
surface  damage  initiation,  and  no  cleaning  emission 
was  observed.  Furthermore,  the  emission  of  charge 
prior  to  damage  in  GaAs  did  not  affect  the  observed 
damage  behavior  in  any  detectable  way. 

It  appears  that  neither  the  uniform  heating  model 
nor  the  carrier  pair  production  model  can  adequately 
describe  the  damage  mechanism  of  GaAs.  In  addition, 
it  does  not  seem  possible  to  relate  the  nature  of  the 
damage  to  charge  emission  due  to  surface  cleaning, 
although  the  latter  appears  to  be  a  necessary,  and 
possibly  essentia),  precuisor  to  damage.  From  these 
results  we  are  led  to  speculate  on  other  possible  influ¬ 
ences  that  may  cause,  or  enhance,  surface  damage. 
Lattice  defects  near  the  surface  of  the  GaAs  wafer  are 
possible  energy  absorption  sites  for  nucleating  dam¬ 
age.  These  lattice  defects  include  anomalous  vacan¬ 
cies,  interstitials,  and  dislocations  introduced  during 
the  crystal  growth  or  surface  pi  eparation  processes.  A 
single-point  defect  seems  an  improbable  physical  area 
for  absorbing  sufficient  energy  to  cause  a  melt  spot.  It 
is  more  probable  that  a  cluster  of  point  defects  will  act 
as  an  efficient  absorption  site.  This  would  result  in  a 
surface  damage  morphology  of  random  melt  points 
within  the  beam  diameter  of  the  laser. 

Other  possible  sites  for  the  nucleation  of  damage  are 
physical  surface  defects  caused  by  processing  or  physi- 
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cal  handling  of  the  wafer  that  would  scratch  the  sur¬ 
face.  Surface  defects  on  an  optical  surface  will  in¬ 
crease  absorption  and  scattering,  possibly  creating 
absorption  nuclei  which  could  lead  to  damage.  The 
potential  for  nucleation  at  physical  defects  have  been 
previously  speculated  by  Smith.8  In  our  experiments, 
microscopic  inspection  before  damage  has  eliminated 
the  possibility  of  gross  surface  defects  (>1  ^m)  as  a 
cause  of  damage.  One  would  not  have  expected  these 
types  of  defect  in  integrated  circuit  quality  material. 

V.  Conclusion  and  Comments 

We  observed  accumulation  effects  in  GaAs  with 
multiple-pulse  laser  irradiance  below  the  1-on-l  dam¬ 
age  threshold  fluence.  These  experiments  also  yield¬ 
ed  various  N-on-1  laser  damage  thresholds  derived 
from  the  statistics  of  the  surface  damage.  In  the  ex¬ 
periments  we  also  measured  the  linear  and  nonlinear 
optical  absorption  constants  for  the  GaAs  sample. 
With  this  information,  calculations  were  performed  to 
determine  the  mechanism  responsible  for  the  surface 
damage.  Results  for  the  uniform  heating  and  carrier 
production  models  clearly  show  that  these  mecha¬ 
nisms  could  not  be  responsible  for  damage.  The  possi¬ 
bility  remains  that  nonuniform  localized  carrier  gener¬ 
ation  and  heating  may  be  involved  in  the  damage 
process.  An  inhomogeneous  heating  model  is  sup¬ 
ported  by  the  observed  damage  morphologies.  It  is 
also  possible  that  the  sample  contained  absorbing  de¬ 
fects,  or  defect  clusters,  that  could  grow  and  contribute 
to  the  damage  process  or  surface  defects  (e.g.,  pits  and 
scratches)  that  might  cause  a  melt  spot  to  nucleate. 
Although  in  our  experiments  the  former  is  believed  to 
be  more  likely. 

By  investigating  the  pulse -by -pulse  history  of  the 
charged  particle  emission,  a  cleaning-type  profile  was 
discovered.  This  cleaning  emission  proved  to  have  no 
observable  influence  on  the  subsequent  damage.  Ap¬ 
parently,  adsorbates  remaining  on  the  wafer  surface 
after  cleaning  did  not  act  as  absorbing  sites  to  cause 
surface  damage.  The  charge  emission  proved  to  be 
very  noisy  from  site  to  site.  Although  the  emission  did 
not  strongly  correlate  with  the  event  of  surface  dam¬ 
age,  once  surface  damage  commenced  an  increase  in 
charge  emission  was  observed.  This  behavior  is  con¬ 
sistent  with  the  idea  of  a  hot  partially  ionized  vapor 
being  released  from  melt  pits  as  damage  proceeds. 

Further  investigation  is  needed  to  understand  the 
mechanism  of  energy  transfer  that  causes  damage  and 
emission.  The  relationship  of  the  accumulation  effect 
to  lattice  defects  could  prove  to  be  a  fundamental  key 
in  understanding  the  damage  process.  Another  pa¬ 
rameter  which  might  influence  the  damage  threshold 
is  the  dopant  level  of  the  semiconductor  material. 
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Abstract — We  discuss  the  results  of  a  new  Iw  o-dimensional  (2-0) 
finile-element  model  for  GaAs  MESFET’x  made  by  ion  implantation. 
Several  different  devices  are  characterised  by  varying  gate  recess  and 
doping  profile.  The  simulation,  in  qualitative  agreement  with  experimen¬ 
tal  findings,  shows  that  a  EET  with  a  shallow  gate  recess  exhibits  a  similar 
behavior  to  a  EET  with  a  deep  implantation,  i.e.,  an  improvement  in 
linearity,  a  higher  pinch-off  voltage,  and  a  decrease  in  (ransconductance. 

!.  Introduction 

THE  PERFORMANCE  of  ion-implanted  GaAs  MESFET’s 
is  usually  predicted  [  1  ],  [2]  using  an  analytical  model  such 
as  the  "two  region  model"  (3|  To  include  diffusion  current 
terms  and  transverse  current  terms  more  adequately,  we  have 
adopted  a  two-dimensional  finite-element  method  (2-D  FEM) 
to  numerically  simulate  GaAs  MESFET's  with  nonuniform 
doping  profiles  and  various  gate  recesses.  In  particular,  we 
have  studied  a  model  ion-implanted  structure,  varying  both  the 
depth  of  the  implant  relative  to  the  source-drain  surface  and 
the  depth  of  the  gate  recess.  For  these  numerical  simulations 
the  FEM  has  several  advantages  over  the  finite-difference 
method,  such  as  the  flexibility  of  the  mesh  size  used  in  the 
calculation  and  the  inclusion  of  current  conservation  without 
the  need  of  phantom  nodes  at  insulatory  boundaries  where 
default  Neumann  boundary  conditions  are  applied. 

II.  Simulation 

Using  a  finite-element  algorithm  from  a  previous  work  [4|, 
the  coupled  Poisson  equation  and  current  continuity  equations 
in  the  device  arc  solved  to  find  the  unknown  potentials  and 
electron  concentrations  under  various  bias  conditions  using  a 
standard  FEM  formulation  (5).  Once  the  electric  field  is 
calculated  by  taking  an  average  for  all  Gaussian  point  values  in 
an  element,  an  effective  mobility  is  obtained  using  the 
assumed  velocity  versus  electric-field  relationships  [6].  Using 
the  calculated  electric  field  and  velocity,  a  temperature  is 
found  from  the  energy  transport  equation  [7]  neglecting  the 
time  dependence  of  energy  for  a  first-order  approximation. 
With  the  effective  mobility  and  temperature,  the  diffusion 
coefficient  is  calculated  from  Einstein's  relation. 

The  time  step  used  in  the  simulation  is  fixed  at  0.01  ps,  i.e  , 
the  dielectric  relaxation  time  for  material  doped  to  a  level 
similar  to  the  peak  of  the  doping  profile  From  these,  I-V 
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characteristics  and  equivalent  circuit  parameters  have  been 
calculated.  Unlike  the  triangular  elements  preferred  in  many 
semiconductor  simulations,  the  bilinear  rectangular  elements 
adopted  here  make  it  easier  to  manipulate  the  input  and  output 
data  by  block  mesh  generation.  The  boundary  conditions  are 
applied  easily  by  the  so-called  penalty  method  [8],  It  is  found 
that  satisfactory  results  are  obtained  with  less  than  500  nodes. 
For  a  typical  node  number  of  307,  it  takes  less  than  2  s  of 
execution  time  for  each  iteration  on  a  CDC  Dual  Cyber  170/ 
750  at  the  University  of  Texas  at  Austin. 

III.  Device  Structure  and  Approximations 

The  basic  device  structure  used  in  this  simulation  is  shown 
in  Fig.  1.  We  have  assumed  a  finite  active-layer  depth, 
neglecting  carrier  transport  far  from  the  implant  peak.  The 
deep  region  is  considered  to  be  semi-insulating.  This  assump¬ 
tion  is  based  on  the  reported  observations  that  phenomena  such 
as  deep  traps  (2).  [9],  1 10]  contribute  to  mobility  degradation 
in  this  deep  active  region.  For  all  the  simulations  we  have  also 
assumed  a  Gaussian-like  active-layer  doping  profile,  with  a 
constant  peak  carrier  concentration  of  2.5  x  10 17  cm  3.  First, 
the  effects  of  gate  recess  changes  were  studied  by  holding  the 
depth  of  the  doping  profile  constant  (as  shown  in  Fig.  1(b)), 
with  the  peak  located  at  85  nm  from  the  source-drain  surface. 
Next,  to  study  the  effects  of  implant  depth,  the  gate  recess  was 
set  to  zero  and  the  surface  location  of  the  electrodes  was  varied 
instead,  which  results  in  a  change  in  the  gate-to-profile  peak 
distance.  Here  the  implanted  doping  profile  is  left  unchanged. 

The  doping  concentration  below  the  source  and  drain 
contacts  is  fixed  at  2  x  10|7  cm  \  and  below  the  Schottky 
gate  contact  is  set  to  zero  to  approximate  the  built-in  depletion 
region.  The  built-in  gate  voltage  is  assumed  to  be  0.8  V.  It  is 
assumed  that  the  low  field  mobility  is  3200  cnr/V-s  (appro¬ 
priate  for  electrons  near  the  peak  of  the  doping  profile),  the 
saturation  velocity  is  107  cm/s,  and  the  threshold  electric  field 
is  4  kV/cm.  In  order  to  simplify  the  model  we  have  taken  these 
parameters  to  be  independent  of  doping. 

IV.  Results  and  Discussion 

The  /-  V  characteristics  obtained  by  varying  the  gate  recess 
depth  are  shown  in  Fig.  2.  The  data  for  a  gate  depth  of  55  nm 
arc  comparable  to  previous  experimental  results  (1 1).  As  the 
gate  depth  increases,  the  transconductancc  becomes  larger,  the 
magnitude  of  pinch-off  voltage  becomes  smaller,  and  the 
linearity  of  (ransconductance  variation  becomes  poorer.  Simi¬ 
lar  phenomena  are  observed  for  the  various  implantation  cases 
shown  in  Fig.  3.  Here  the  /-  V curves  behace  in  a  fashion  quite 


0741  3I06/86/0400-0208S01.00  1986  IEEE 


SONG  el  al..  MODELING  OF  ION-IMPLANTED  GaAs  MESFETs 


209 


Lgs  Lg  Lgd 


0  50  100  ISO  200 

DEPTH(nm) 

(b) 


Fig  I.  Two-dimensional  GaAs  MESFET  model  (a)  Dimensions  of  device 
simulated  Dram  and  source  length:  0.2  |im.  gale  length  L,  0.5  um-  gate 
to-source  distance  L„  1  *im;  gale-to-drain  distance  l.td.  1.5  ^m;  gate 
width:  300  /on.  (b)  Carrier  profile  used  in  simulation.  All  the  carrier 
profiles  used  have  a  peak  concentration  of  2.J  x  10'Vcm'.  For  Fig.  2,  the 
source  and  drain  surface  is  located  at  0  nm.  and  for  Fig.  3  the  source,  drain, 
and  gate  surfaces  are  all  located  at  30.  55.  and  70  nm  for  project  ranges  55. 
30.  and  15  nm,  respectively 
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Fig.  2.  /-  V characteristics  simulated  by  varying  gate  depth  in  recessed  gate 

structure 
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Fig.  3.  l-V  characteristics  simulated  by  varying  carrier  profile  in  nonre- 
cessed  gate  structure. 
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Fig.  4  Plot  of  drain-current  variation  versus  gate-to-source  voltage  for  Figs. 

2  (recessed)  and  3  (planar). 

similar  to  the  I-  V  curves  for  a  gate  recess  with  identical  gate- 
to-profile  peak  distances.  These  phenomena  are  explicitly 
shown  in  Fig.  4  for  one  typical  drain  voltage.  Here,  the 
variation  of  the  slope  is  related  to  the  linearity.  Such  results  as 
an  improvement  in  linearity  with  a  corresponding  decrease  in 
intermodulation  distortion  for  deeper  implants  (i.e.,  larger 
gate-to-peak  distance)  have  been  reported  previously  ( 12)- 
[15].  This  particular  characteristic  is  important  in  large-signal 
applications,  and  therefore  a  trade-off  between  transconduct¬ 
ance  and  linearity  may  be  required  in  these  applications. 

The  effects  of  changes  in  the  tail  of  the  active-layer  doping 
profile  have  also  been  investigated.  Current-voltage  curves 
for  devices  with  the  same  profile  shown  in  Fig.  1(b)  but  with 
an  additional  tail  10-20  nm  long  (extending  the  active-layer 
doping  concentration  down  to  3  x  10 15  cm  ’  from  the 
original  1  x  1016)  have  been  calculated.  It  is  found  that  the 
pinch-off  voltage  increases  somewhat,  while  the  transconduct¬ 
ance  decreases  near  pinch-off.  These  changes  are  thought  to  be 
related  to  the  steepness  of  the  doping  profile  near  the  substrate 
interface.  The  steeper  profiles  appear  to  give  more  linear 
behavior  near  pinch-off— an  important  consideration  in  the 
design  of  low  noise  devices. 

Typical  values  of  the  small-signal  transconductance  and 
gate-to-source  capacitance  at  zero  gate  bias  are  shown  in  Fig. 
5.  In  the  case  of  recessed  gate  device,  it  can  be  seen  that  the 
cutoff  frequency  fT(fr  =  gm/(2*Cel))  increases  with  gate 
recess  depth.  This  effect  can  be  explained  as  follows:  as  the 
distance  between  the  gate  and  the  active  layer  implant  peak 
decreases,  a  smaller  gate  voltage  swing  is  required  to  affect  a 
given  drain  current  change  [2),  thus  increasing  the  transcon¬ 
ductance  gm.  At  the  same  time  gate-to-source  capacitance  C* 
decreases  because  the  gate  depletion  layer  extends  beyond  the 
implant  peak  as  the  recess  becomes  deeper.  These  effects 
combine  to  produce  an  increase  in  fj.  For  very  deep  recesses 
(i.e.,  very  small  gate-to-implanl  peak  separation)  the  transcon- 
ductancc  begins  to  decrease,  and  the  cutoff  frequency  fr 
saturates  at  about  24  GHz.  Although  no  graphical  data  are 
shown  here,  somewhat  different  results  are  obtained  for  gate 
bias  voltages  near  pinch-off,  where  the  depletion  layer  is 
always  deeper  than  the  implant  peak.  It  is  found  that  as  the 
recess  increases,  C,s  increases  rather  than  decreases  as  it  docs 
in  the  low  gate  bias  case. 

The  results  for  planar  devices  obtained  by  varying  gate-to- 
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Fig  5  Calculated  results  of  transconductancc  and  gate -to -source  capaci¬ 
tance  for  Figs  2  and  3 


peak  distance  show  similar  effects.  The  larger  at  high  drain 
current  for  the  recessed  gate  structure,  compared  to  the  planar 
structure,  is  caused  by  the  thicker  active  regions  under  the 
source  and  drain  in  the  recessed  gate  structure. 

V  Conclusions 

A  two-dimensK  na!  Unite  clement  method  has  been  used  to 
characterize  ion- implanted  GaAs  MESFFT's  This  simulation 
allows  accurate  modeling  of  changes  in  device  geometry, 
including  the  locaiion  of  both  the  channel  implant  and  the  gate 
recess.  Results  of  the  model  have  shown  that  FET'.s  with  a 
shallow  recess  show  similar  behavior  to  those  with  a  deep 
implant,  i.e..  an  improvement  in  linearity,  a  higher  pinch-off 
voltage,  a  decrease  in  transconductancc.  and  a  corresponding 
decrease  in  cutoff  frequency.  Depending  on  the  device 
application,  different  choices  for  these  parameters  would  be 
appropriate  Thus  this  simulation  could  be  useful  as  a  guide  in 
the  fabrication  of  optimized  devices.  Further  calculations  are 


in  progress  to  determine  bias-dependent  parameter  changes, 
which  are  especially  important  in  large-signal  applications 
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A  statistical  method  for  modeling  the  linear  and  quadratically  nonlinear  relationship  between  fluctuations  monitored  at  two 
points  in  space  or  time  in  a  turbulent  medium  is  presented.  This  relationship  is  described  with  the  aid  of  linear  and  quadratic 
transfer  functions  and  the  concept  of  coherency  is  extended  to  quantify  the  goodness  of  the  quadratic  model  A  unique  feature 
of  the  approach  described  in  this  paper  is  that  it  is  valid  for  non-Gaussian  “input"  and  “output”  signals.  The  validity  of  the 
approach  is  demonstrated  with  simulation  data.  The  method  is  applied  to  experimental  data  taken  in  the  turbulent  edge  plasma 
of  the  TEXT  tokamak  The  results  indicate  a  three  wave  process  with  energy  transfer  to  large  scale  fluctuations.  The  estimation 
of  transfer  functions  is  a  first  step  in  quantitatively  measuring  coupling  coefficients  and  the  energy  transfer 


1.  Introduction 

Many  fluctuation  phenomena  in  nature,  as  well 
as  in  various  technical  problems,  can  be  reduced 
to  relatively  simple  systems  which  are  describable 
by  a  set  of  source  (“input”)  signals  and  the  re¬ 
sponse  (“output”)  signals  of  the  system.  The  sys¬ 
tem  can  then  be  considered  as  a  “black  box”.  By 
modeling  this  “black  box”  by  an  appropriate  net¬ 
work,  consisting  of  linear,  quadratic  and  higher- 
order  nonlinear  elements,  it  is  possible  to  gain 
considerable  insight  into  the  dynamics  of  the  sys¬ 
tem  under  test. 

The  simplest  such  network  consists  of  a  single 
input,  single  output  system  which  is  purely  linear, 
since  the  nonlinear  contributions  of  the  system  are 
assumed  to  be  negligible.  The  determination  of  the 
linear  system  model  from  the  measured  input  and 
output  signals  is  based  upon  cross-correlation 
techniques.  Applications  of  the  linear  network  led 
for  example  to  the  estimation  of  the  dispersion 
relation  in  plasmas  [1-4].  It  is  also  often  used  to 
test  complex  electronic  circuits.  In  this  work  we 
will  discuss  a  system  which  can  be  described  by  a 


single  input  and  single  output  which  will  be  mod¬ 
eled  in  the  spatial  or  temporal  frequency  domain 
by  linear  and  quadratic  elements  of  the  form 

Yp  =  LpXp  +  \  £  Qr>r>XpXp2  +  ep.  (1) 

Pi -Pi 
P-P  i  +P  2 

Such  a  system  is  presented  schematically  in  fig.  1 . 
Lp  and  Qp,  P}  are  usually  called  linear  and 
quadratic  transfer  functions  and  are  generally 
complex  quantities.  The  Fourier  transforms  of  the 
measurable  input  signal  x(s)  and  of  the  output 
signal  y(s)  are  Xp  and  Yp,  respectively.  The  sig¬ 
nals  x(s)  and  y(s)  are  assumed  to  be  zero  mean 
stationary  random  processes.  The  error  term  ep  is 
the  Fourier  transform  of  a  process  which  is  as¬ 
sumed  to  be  statistically  independent  of  the  first 
two  terms  of  eq.  (1).  It  can  be  regarded  as  the 
error  due  to  noise  inherent  in  the  measurement  as 
well  as  systematic  errors  not  described  by  linear 
and  quadratic  terms.  The  goal  is  to  estimate  the 
linear  and  quadratic  transfer  functions  from  the 
measured  input  and  output  signals. 
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SYSTEM  MODEL 

Fig  1  Schematic  model  of  the  nonlinear  s>>tem.  given  in  eq. 
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Indeed,  we  have  motivation  for  modeling  vari¬ 
ous  nonlinear  physical  systems  with  such  a 
quadraticallv  nonlinear  equation,  e.g.  eq.  (1).  Ex¬ 
amples  of  applications  include  electronic  net¬ 
works,  electromagnetics  [5,  6],  and  parametric 
excitation  in  mooring  dynamics  [7-9],  in  structural 
vibrations  [10]  and  in  nonlinear  optics  [11],  Char¬ 
acteristic  of  these  examples  is  that  the  spectral 
components  Xp  and  Yp  stand  for  the  temporal 
frequency  spectra.  On  the  other  hand,  many  physi¬ 
cal  phenomena  can  be  described  by  an  equation 
similar  to  eq.  (1)  with  Fourier  components  Xp  and 
Yp  representing  spatial  frequencies,  i.e.  wave  num¬ 
bers.  When  we  consider  the  Navier-Stokes  equa¬ 
tion  for  neutral  fluids  and  Fourier  transform  in 
space,  we  find  a  wave-coupling  equation  which 
describes  the  temporal  change  of  a  spectral  com¬ 
ponent  of  given  wavenumber  due  to  a  linear  mech¬ 
anism  (growth  rate,  dispersion)  and  due  to  three 
wave  coupling  [e.g.,  12,  13],  The  linear  and  qua¬ 
dratic  nonlinear  terms  in  this  equation  can  be 
modeled  by  eq.  (1).  Similiar  wave  coupling  equa¬ 
tions  describe  the  turbulent  behavior  in  plasmas 
[14,  15]  and  in  the  solid  state  [16].  In  several  of 
these  fields,  competing  theoretical  models  exist  to 
describe  the  formation  of  the  turbulent  spectra. 
Therefore,  it  is  useful  to  have  an  experimental 
measurement  available  with  which  the  competing 
theories  can  be  compared  directly.  In  the  past  we 


reported  preliminary  experimental  results  which 
estimate  the  coupling  coefficient  and  energy 
cascading  in  such  a  turbulent  plasma  [17).  The 
estimation  of  the  nonlinear  transfer  function  is  a 
necessary  first  step  to  quantifying  the  coupling 
coefficients  and  the  energy  transfer.  The  approach 
has  not  yet  been  published. 

Note  that  eq.  (1)  is  the  simplest  variant  of  an 
equation  which  can  be  related  to  wave-wave  cou¬ 
pling.  as  we  assume  that  four-wave  coupling  and 
higher  order  processes  are  much  weaker  than 
three-wave  coupling.  In  some  cases  when 
three-wave  nonlinear  coupling  is  forbidden  by  the 
dispersion  properties  of  the  system,  as  for  example 
for  surface  gravity  waves  in  water  [18],  higher-order 
terms  must  be  included. 

The  determination  of  the  transfer  functions  Lp 
and  Qp'  P:  is  straight  forward  and  well  known  for 
the  case  of  a  Gaussian  input  signal  x(s)  [19-21]. 
Such  a  method  can  be  used  for  analyzing  a  system 
that  can  be  excited  externally  by  a  Gaussian  noise 
source  [20]  and  for  systems  with  input  signals 
which  can  be  assumed  to  be  Gaussian  [21],  Many 
systems  such  as  turbulent  fluids  and  plasmas,  how¬ 
ever.  do  not  allow  such  a  restrictive  assumption  for 
the  input  signal.  In  order  to  obtain  insight  into  the 
physical  mechanisms  of  turbulence  in  these  media, 
one  can  monitor  the  fluctuations  at  two  points  in 
time  or  space  and  study  the  change  of  the  spectra 
between  these  points.  In  these  cases  and  in  general 
the  input  should  not  be  considered  to  be  a  Gauss¬ 
ian  process  because  of  nonlinear  history  of  the 
fluctuations. 

The  main  objective  of  this  work  is  to  present  a 
technique  which  enables  one  to  estimate,  in  an 
efficient  way.  the  linear  and  quadratic  transfer 
functions  from  the  measured  fluctuation  signals 
x(s)  and  y(.s ).  These  transfer  functions  serve  as 
the  fundamental  quantities  with  which  to  estimate 
the  growth  rate,  the  wave-wave  coupling  coeffi¬ 
cients  and  finally  the  energy  transfer  between 
different  spectral  components  [17],  This  paper 
focuses  on  the  technique  of  estimating  the  transfer 
functions  and  qualitatively  describing  the  physical 
results.  A  subsequent  paper  will  cover  the  applica- 
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tion  of  this  concept  to  estimating  quantitatively 
the  coupling  coefficient  and  energy  cascading. 

In  section  2  we  present  a  new  technique  to 
estimate  transfer  functions.  For  the  case  of  non- 
Gaussian  input  signals  the  usual  definition  of  the 
coherency  must  be  generalized.  The  influence  of 
noise  and  of  systematic  errors  will  also  be  briefly 
discussed.  As  the  approach  involves  rather  exten¬ 
sive  computation,  the  convergence  and  accuracy  of 
the  model  must  be  tested  prior  to  applying  it  to 
real  data.  We  present  the  results  of  such  a  test  in 
section  3.  In  section  4.  the  method  is  applied  to 
data  from  turbulence  measured  in  the  edge  plasma 
of  the  TEXT  tokamak.  We  estimate  the  linear  and 
quadratic  transfer  functions  between  two  points  in 
space  and  interpret  them  in  physical  terms.  Next 
we  compare  the  linear  transfer  function  with  the 
one  obtained  by  neglecting  nonlinear  contribu¬ 
tions.  Such  a  comparison  is  of  interest  since  most 
present  correlation  methods  are  based  on  a  linear 
assumption. 

2.  Method 

The  transfer  functions  for  the  quadratic  nonlin¬ 
ear  system  given  by  eq.  (1)  can  be  derived  in  a 
straightforward  way  when  a  sufficient  number  of 
independent  realizations  of  the  spectra  Xp  and  )' 
are  available.  The  variable  p  represents  for  exam¬ 
ple  the  wavenumber  or  temporal  frequency. 

We  first  rewrite  eq.  (1)  by  using  the  symmetry 
relation.  QPi  P:  =  Qp  />|  (Note  that  the  spectral 
components  XPi  and  X  are  interchangeable  in 
eq.  (1)).  It  is  therefore  sufficient  to  sum  the 
quadratic  terms  over  the  frequency  components 
with  p,  >  p2. 

Yp  =  LpXp  +  E  +  V  <la> 

P\  ■  P: 

P-Pi  •  r: 

Multiplying  eq.  (la)  with  the  complex  conjugate  of 
the  input  signal  Xp  and  computing  the  expected 
value  by  ensemble  averaging  (  )  over  many  statis¬ 


tically  similar  realizations,  we  obtain 

<>;*;>-  E  Qpp'  p:(x;xpxp;)  -  (fpx;) 

I _ P\  ^  Pi _ 

(xpxp) 

P  =  P\+Pz  (2) 

Similarly,  by  multiplying  Y  by  X'*.X*,  and  en¬ 
semble  averaging,  we  have 

<Yrx;xP*y  =  Lp(xPxPrxpV 

+  E  Qp'p:(xpxp:x;x;.)  +  (epx;.x;,). 

Pi  -  Pi 

(3) 

with 

P  =  P\  +P:=P\  +P2- 

To  simplify  eq.  (3)  and,  therefore,  reduce  the 
computation  time  we  approximate  the  fourth-order 
moment  (XptXpX*X*.y)  with  second-order  mo¬ 
ments  A^J2)  by  neglecting  components 

(Xp,Xp:Xp\XpV  with  <Pi' Pi)-  How‘ 
ever.  we  retain  the  third-order  moment  of  the 

input  signal  ( Xp  .P,X*X*;),  as  we  assume  that 
the  input  signal  is  non-Gaussian  Such  an  ap¬ 
proach  was  proposed  by  Millionshchikov  [22]  and 
is  used  to  close  the  system  in  weak  turbulence 
theories  [12  13).  This  approach  was  also  used  in 
many  strong  turbulence  theories  [23,  24],  in  which 
the  linear  mode  structure  is  strongly  altered  by  the 
turbulence  and  for  which  Au/uj  is  large.  Compari¬ 
son  with  simulations  which  do  not  use  such  a 
closure  scheme  confirmed  the  validity  of  this  ap¬ 
proximation  also  in  strongly  turbulent  cases 
[24,  25].  Approximating  the  fourth-order  moments 
requires  that  the  fluctuations  must  be  close  to 
Gaussian  distributed,  a  condition  which  is  not 
generally  valid  for  strong  turbulence.  In  a  future 
work  it  is  planned  to  compute  the  complete  set  of 
fourth-order  moments  and  solve  for  the  transfer 
functions.  This  will  allow  us  also  to  test  the  valid¬ 
ity  of  the  closure  scheme  on  real  data.  Using  the 
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Millionshchikov  hypothesis  we  have 


=  ■'  >•„  x;,  -V; )  -  LP(  *,  X*  x*  )-(ep  x*  x*  > 
<iAp  Apj’> 

P  =  P\+Pi-  (3a) 

The  second  term  in  the  numerator  of  equations 
(2)  and  (3a)  vanish  for  a  Gaussian  input  because 
then  the  skewness  {(xl(s))/(x2{s)/'2)  is  zero, 
therefore  (  Xpi  .p_X*X*,)  =  0.  The  second  terms 
can  thus  be  interpreted  physically  as  corrections 
due  to  the  nonlinear  ..istory  of  the  fluctuation 
detected  at  the  input.  As  long  as  the  error  term  ep 
is  zero  mean  and  statistically  independent  with 
respect  to  the  input  signal,  the  error  terms  in  eqs. 
(2)  and  (3a)  vanish. 

The  set  of  dependent  equations  (2)  and  (3a)  can 
be  solved  iteratively  to  find  the  transfer  functions 
Lp  and  Qp‘  P:.  As  an  initial  guess  we  neglect  the 
quadratic  contribution  in  eq.  (2)  and  find  for  the 
linear  transfer  function 

.  <YrXn 

Lr-(XpX?)-  (2a) 


The  number  of  iterations  used  for  a  given  frequency 
p  depends  on  the  accuracy  of  the  estimated  mo¬ 
ments.  It  is  also  sensitive  to  the  stationarity  of  the 
signal  to  be  analyzed  and  is  thus  larger  for  real 
data  than  for  simulated  data.  The  number  of  itera¬ 
tions  is  also  dependent  on  the  magnitude  of  the 
quadratic  transfer  function  with  respect  to  the 
linear  one.  The  typical  number  of  iterations  needed 
for  Lp  to  change  by  8  less  than  1  %c  is  of  order  5 
for  simulation  data  and  of  order  10  for  measured 
data  (6<  \%).  Note  that  the  special  case  of  a 
Gaussian  input  signal  can  be  solved  directly 
without  an  iterative  procedure  as  the  first  term  on 
the  right-hand  side  of  eq.  (3)  vanishes. 

As  indicated  by  equations  (2)  and  (3a).  I.p  and 
Qp'  p  can  be  obtained  by  determination  of  the 


well-known  statistical  quantities  in  the  pointed 
brackets,  the  auto-power  spectrum  Pp  =  (X  X*), 
the  cross-power  spectrum  Cp=  (  Yp  X*  ) .  the  auto¬ 
bispectrum  Bd(px,  p2)  =  (Xpi,P:X*X*:)  and  the 
cross-bispectrum  £c( />,.  p2)  =  (Ypi.p, X*X/). 
The  cross-power  spectrum  measures  the  statistical 
dependence  of  amplitude  and  phase  of  the  same 
spectral  components  in  the  input  and  output. 
Therefore  the  cross-power  spectrum  must  play  an 
important  role  in  detecting  the  dispersion  relation 
and  the  growth  rate.  The  auto-  and  cross-bispectra 
measure  the  statistical  relationship  of  amplitude 
and  phase  between  the  spectral  components  px,  p2 
and  P=P\+P2-  ^  the  waves  at  px,  p2  and  p 
have  statistically  independent  random  phases,  then 
the  resulting  (bi-)phase  of  the  polar  representation 
(6  -  6pi  -  6p  )  will  be  random  and  the  ex¬ 

pected  value  of  the  bispectra  converges  to  zero.  If. 
however,  a  coherent  phase  relationship  exists  due 
to  nonlinear  coupling  of  these  waves,  the  bispec¬ 
tra.  averaged  over  many  realizations,  will  reach  a 
finite  value.  The  auto-  and  cross-bispectra  play, 
therefore,  an  important  role  in  detecting  three- 
wave  coupling  effects.  The  mathematical  and  stat¬ 
istical  background  of  the  estimation  of  these  ex¬ 
pected  values  with  the  aid  of  digital  signal 
processing  are  thoroughly  discussed  in  [26-28]. 

The  amount  of  information  gained  from  the 
transfer  functions  can  be  quite  large.  Therefore  it 
is  useful  to  introduce  normalized  quantities  to 
provide  an  easier  interpretation  of  the  input -out¬ 
put  relationship.  A  convenient  normalization  is 
the  coherency,  which  gives  the  fraction  of  the 
power  in  the  output  signal  which  can  be  accounted 
for  by  the  linear  and  the  quadratic  transfer  func¬ 
tion  model.  To  define  the  coherencies  in  terms  of 
the  transfer  functions,  we  multiply  eq.  (la)  by  its 
complex  conjugate,  take  a  statistical  average  and 
divide  the  result  by  the  output-power  ( YpY *).  We 
find 


1  =  Y{(  P)  +  YqIp)  +  yIq(p) 


+  y/  p)  +  (error  terms). 
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Tflfl-  lL,'r\yr) ' 

Til p)-  I 


YCy(  P)  = - 

Yn(p)  <W> 


2Re(^  I  \Qp'  P'Y(XrX*,Xp:) 


<w> 


where 


P=Pi+  P: 


zero  and  unity.  The  "goodness  of  fit"  of  the  model 
can  then  be  characterized  by  the  total  coherency 
of  the  model. 

y:(  P)  =  Yr(  P)  +  />)  +  Ylq(  p)  (5) 

Note  that  the  above  definition  of  the  coherency 
converges  to  the  commonly  used  definition  [19,  20] 
when  the  input  signal  is  Gaussian.  In  this  case 
(XPi.p,  X*  X^ )  =  0  and  the  term  y[Q(  p)  vanishes. 
We  find 

yLip)-  (rv;)(yv;)' 


YqI  p) 


y  WWWl 


The  linear  coherency  y[(p)  and  the  quadratic 
coherency  Yq(  p )  denote  the  fraction  of  output 
power  accounted  for  by  the  linear  and  quadratic 
transfer  functions.  The  value  under  the  summa¬ 
tion  in  Yq(  p )  is  called  the  cross-bicoherency 
and  measures  the  portion  of  the  power 
of  the  output  signal  Yp.p  ,p.  which  is  phase  locked 
with  Xp  and  Xpi  in  the  input  signal.  The 
coherency  y[Q(  p )  gives  the  portion  of  the  output 
power,  for  which  the  response  due  to  the  linear 
transfer  and  due  to  the  quadratic  transfer  are 
correlated.  Note  that  this  term  arises  because  of 
the  non-Gaussian  input  jc(s).  The  term  y£(p) 
represents  the  ratio  of  the  output  power  due  to 
noise,  which  can  not  be  accounted  for  by  the 
system.  The  error  terms  in  eq.  (4a)  vanish  as  long 
as  the  error  term  ep  of  the  output  signal  is  zero 
mean  and  independent  of  the  input  signal  of  the 
system.  The  usual  definitions  of  the  coherencies 
[19]  are  bounded  by  zero  and  unity  (as  we  shall 
show  below  for  the  case  of  a  Gaussian  input 
signal).  However,  the  individual  coherencies 
y,’(  p).  Yq<  p)  and  Ylq(  P)  defined  above  are  not 
necessarily  bounded  by  unity.  While  yl(p)  and 
Yy(  p )  must  be  larger  than  zero,  the  term  Yi2q(  P ) 
is  allowed  to  take  on  negative  values  as  well  The 
noise  term  y„:(  P)  can  take  on  any  value  between 


y[Q(p)  =  o. 

where 

P=P  i  +  P:- 

The  linear  coherency  y[(  p)  is  bounded  by  zero 
and  unity  as  can  be  shown  with  the  Schwartz 
inequality.  Because  of  eq.  (4a).  Yq(  p)  must  also  be 
bounded  by  unity.  The  application  of  eq  (6)  to 
experimental  data  with  a  non-Gaussian  input  sig¬ 
nal  will  thus  lead  to  an  erroneous  result  and  can 
yield  values  for  the  total  coherency  which  are 
greater  than  unity. 

3.  Simulation  test 

To  test  the  validity  of  the  approach  described  in 
the  previous  section,  we  have  carried  out  a  com¬ 
puter  simulation.  We  start  with  analytically  de¬ 
fined  linear  and  quadratic  transfer  functions  For 
given  non-Gaussian  input  signals  we  compute  the 
corresponding  output  signals.  Next,  we  apply  the 
method  of  section  2  to  estimate  the  linear  and 
quadratic  transfer  functions  from  the  input  and 
output  data,  and  then  we  compare  the  results  w  th 
the  expected  values. 
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P=Pi+P:-  •  =  v  -  1 

The  magnitude  and  shape  of  the  quadratic  trans¬ 
fer  l unctions  defined  above  are  chosen  arbitrarily, 
but  are  realistic  in  that  the  values  are  of  the  same 
order  of  magnitude  as  the  ones  predicted  by  the 
Hasegawa  Mima  equation  (15].  The  linear  trans¬ 
fer  function  is  defined  such  that  the  input  and 
output  spectra  are  similar  in  shape  as  would  be 
expected  for  a  stationary  state  The  Nyquist 
frequency  shall  be  abbreviated  as  pNvq*  The  real 
and  imaginary  parts  of  Lp  are  illustrated  in  fig.  2a. 
2b  and  the  absolute  value  of  Qpp'  is  shown  in  a 
contour  plot  in  fig.  3a  To  approximate  the  situa¬ 
tion  which  actually  occurs  in  a  continuous  medium, 
we  consider  five  identical  “black  boxes"  of  the 
type  shown  in  fig.  1.  which  are  connected  in  series. 
A  Gaussian  signal  is  applied  to  the  input  of  the 
first  black  box.  The  output,  which  is  now 
non-Gaussian  because  of  the  nonlinear  nature  of 
the  black  box,  becomes  the  input  to  the  second 
black  box  and  so  on.  For  the  simulation  we  utilize 
the  input  and  output  of  the  fifth  black  box.  The 
input  signal,  Xp,  for  the  estimation  can  thus  be 
assumed  to  be  approximately  as  “non-Gaussian” 
as  the  output  signal  Yp. 

In  a  second  step  we  estimate  the  linear  and 
quadratic  transfer  functions  Lp  and  Qp'Pl  using 
the  approach  of  section  2.  As  shown  in  figs.  2c,  2d 
and  fig.  3b,  the  estimated  transfer  functions,  are  in 

*Thc  maximal  frequency  which  can  be  resolved  digitally  is 
given  bv  the  Nvquist  theorem  The  theorem  indicates  that  the 
smallest  detectable  period  must  contain  at  least  two  sampling 
points  Frequencies  larger  than  pNvq  produce  an  erroneous 
spectrum  below  pNvq  due  to  aliasing 


0  S  10  5  1 


p/pwyq  "^Nyq 

Fig  ;  Comparison  of  the  analytically  defined  linear  transfer 
function  with  the  reconstructed  ones:  a)  shows  the  real;  and 
b)  the  imaginary  component  of  the  "true”  values;  c)  and  d)  the 
estimated  ones 

good  agreement  with  the  true  values.  The  symbol, 
denotes  an  estimator.  To  save  space,  we  com¬ 
pare  here  only  the  absolute  value  of  the  quadratic 
transfer  function.  The  phase  comparison  would 
show  a  similar  agreement.  Note,  however,  that  the 
phase  information  is  also  important  for  the  inter¬ 
pretation  of  the  nonlinear  system.  Because  of  the 
symmetry  properties  possessed  by  the  quadratic 
transfer  function,  it  is  not  necessary  to  plot  QPl  Pi 
over  the  entire  two-dimensional  planef.  Fig.  3 
gives  the  value  of  \QP,  P2\  at  the  frequency  p  =  px 
+p2  due  to  wave-wave  coupling  with  />,  and  p2. 
The  area,  for  which  the  transfer  function  is  plotted, 
can  be  subdivided  into  three  regions  with  essential 
differences  in  the  physical  content  (I,  II,  III  in  fig. 
3a).  The  triangular  region  (I)  gives  the  quadratic 
transfer  function  at  the  highest  frequency  p  in¬ 
volved  in  the  interaction  (p>  px.  p2).  Region  (II) 

til  is  sufficient  to  compule  and  plot  the  transfer  function  for 
Px  >  P;  (because  QPl  ’’2  ~  QP:  p‘)  and  for  p  >  0  (because 
QPl  Pl  “IP  JJ1'  Pll*.  as  the  Fourier  transform  for  real  data 
v(  i )  satisfies  \p  —  X*  p) 
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Fig  3  Contour  plot  of  the  amplitude  of  a)  the  analytically  defined  quadratic  transfer  function  Point  A  illustrates  IQ?'  P*I  at 
Pi  /Pn,,  ”  0.78  due  to  coupling  with  p;/pNyq  -  0  22  and  p/pNvq  -  0  56;  b)  reconstruction  of  \Qf'-pi\  by  the  iterative  method  The 
contour  interval  is  005  in  a)  and  006  ir  b) 


gives  the  strength  of  the  coupling  at  the  inter¬ 
mediate  frequency  p  due  to  wave-wave  coupling 
with  a  spectral  component  of  larger  frequency  p, 
and  one  with  a  smaller  frequency  p2(px>p> 
|pi|).  The  triangular  region  (111)  shows  \Q£"Pl\  at 
the  lowest  frequency  component  p  due  to  cou¬ 
pling  with  the  two  others  (p  <px,\p2\)-  For  a 
convenient  graphical  representation  of  the  three 
coupling  regions,  we  plot  positive  and  negative 
values  of  p2.  Note,  however,  the  positive  and 
negative  spectral  components  are  related  by  X  _p 
=  X*.  To  facilitate  the  interpretation  of  fig.  3,  we 
arbitrarily  pick  out  one  point,  which  is  indicated 
as  A.  The  number  of  contours  at  this  point  gives 
the  amplitude  of  the  transfer  function  \QP,'P2 1  at 
frequency  p/p  Nyq  =  0.56  as  a  result  of  its  coupling 
with  a  higher  frequency  component  px/p Nyq  = 
0.78  and  with  a  lower  frequency  component 

l/>2l//>Nyq  =  0.22. 

To  test  the  “goodness  of  fit”  of  this  simulation, 
we  compute  the  coherency  from  eq.  (4b).  The 


difference  between  the  estimated  total  coherency 
and  unity  can  be  regarded  as  due  to  systematic 
errors  of  the  estimation  approach  and  due  to  the 
variance  of  the  estimator  of  the  statistical  quanti¬ 
ties  (eg.  the  cross-bispectrum).  Fig.  4a  shows 
the  contribution  of  the  linear,  quadratic  and  mixed 
coherency  to  the  total  coherency  of  the  model 
y2(p).  We  have  chosen  relatively  small  values  of 
the  quadratic  transfer  function  with  respect  to  the 
linear  component  in  order  to  simulate  a  realistic 
situation;  consequently  the  contribution  of  Yq(  p) 
is  small.  Note  that  the  total  coherency  of  the 
model  y2(p)  is  close  to  unity  indicating  that  the 
systematic  error  of  the  approach  is  small.  We 
conclude  that  the  iterative  method  is  able  to  pro¬ 
duce  a  good  fit  of  the  data. 

To  illustrate  the  necessity  of  applying  the  itera¬ 
tive  method,  we  compute  the  transfer  functions 
with  the  usual  method  by  disregarding  the 
non-Gaussianity  of  the  input  signal,  Xp.  The 
non-iterative  method  then  leads  to  a  quadratic 
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Frequency  p/pNyq 


Fig.  4.  Coherence  spectra,  y lip)  linear  component.  yq( p)  quadratic  contribution.  y^q( p )  mixed  term.  y'(p)  summation  over  the 
three  components  The  difference  between  y 2{p)  and  1.0  represents  the  error  term  y£(  p).  a)  gives  the  coherence  for  the  iterative 
procedure  (eq.  (4a.  b)(;  b)  the  one  computed  with  the  usual  definition  (eq.(6)). 


transfer  function  which  is  much  different  from  the 
true  one.  This  erroneous  result  can  easily  be  visual¬ 
ized  by  applying  the  usual  definition  of  the 
coherency,  given  by  eq.  (6),  which  leads  to  values 
for  the  total  coherency  much  larger  than  unity  (fig. 
4b).  The  systematic  error  introduced  by  assuming 
a  Gaussian  input  signal  can  thus  be  considerable. 
Note,  for  the  non-Gaussian  case,  the  linear 
coherency  is  very  close  to  unity.  This  is  due  to  the 
very  small  change  of  the  signal  between  the  input 
and  output  as  a  result  of  the  quadratic  interaction 
(which  is  of  order  y q(p)  in  fig.  4a). 

It  is  important  to  recognize  that  a  good  estima¬ 
tion  of  the  quadratic  transfer  function  requires  a 
large  number  of  realizations.  For  this  simulation 
we  have  used  2000  realizations.  This  large  number 
of  realizations  is  necessary  in  order  to  estimate  the 
auto-  and  cross-bispectra  accurately  and  thus  to 
assure  a  proper  convergence  of  the  iterative  pro¬ 


cess.  To  illustrate  this,  we  show  in  fig.  5  the 
convergence  of  both  the  power  spectrum  Pp  and 
the  amplitude  of  the  auto-  and  cross-bispectra  to  a 
stable  value  as  a  function  of  an  increasing  amount 
of  realizations.  We  have  chosen  for  this  demon¬ 
stration  the  frequency  components  p/p  Nyq  =  0.47, 
Pl/>Nyq“0-31.  and  A’2//»Nyq  =  016-  Whi,e  the 

auto-power  spectrum  stabilizes  after  500  reali¬ 
zations,  more  than  1500  realizations  for  the  auto- 
and  cross-bispectra  are  needed.  The  large  variance 
of  the  higher-order  spectra  can  be  attributed  to 
the  fact  that  all  spectral  components  in  a  turbu¬ 
lent  spectrum  couple  with  each  other  and  there¬ 
fore  the  contribution  of  each  triplet  of  waves  is 
relatively  smalt. 

By  testing  the  iterative  method  under  different 
conditions,  we  have  found  another  interesting  fea¬ 
ture  which  we  briefly  report.  For  a  purely  Gauss¬ 
ian  input,  Xp,  the  iterative  nrocedure  appears  to 
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Fig.  5.  Convergence  of  the  spectraJ  estimates  to  a  stable  value  as  /unction  of  the  number  of  realizations,  a)  Convergence  of  the  power 
spectrum  at  p/pNyq  -  0.47;  and  b)  of  the  amplitude  of  the  auto-  and  cross-bispectra  at  p/pNyq  -  0  47  due  to  wave-wave  coupling 
with  p,/pNyq- 0.16  and  P,/pNyq  -  0.31. 


considerably  reduce  the  number  of  realizations 
needed  for  a  given  statistical  error  of  the  quadratic 
transfer  function.  While  the  auto-  and  cross-power 
spectra  are  already  estimated  acceptably,  the 
estimate  of  the  bispectra  is  significantly  poorer  for 
the  same  amount  of  realizations.  Because  of  a 
similar  behavior  of  the  auto-  and  cross-bispectra 
(see  their  convergence  in  fig.  5),  the  statistical 
errors  therefore  approximately  cancel  in  eq.  (3a) 
and  the  estimation  of  Q£'  Pl  is  improved  for  the 
same  amount  of  realizations.  We  conclude  that  the 
iterative  procedure  is  valuable  to  increase  the  speed 
of  the  calculation  for  systems  with  a  purely  Gauss¬ 
ian  input  signal  and  also  requires  fewer  realiza- 


4.  Nonlinear  wave  coupling  in  the  edge  of  a  tokamak 


The  plasma  which  connects  the  hot  interior 
plasma  of  a  tokamak  with  the  cold  wall  is  often 
called  edge  plasma.  This  region  of  the  tokamak  is 
characterized  by  steep  density  and  temperature 
gradients  and  by  a  high  density  fluctuation  level. 
Asymmetries  induced  by  the  limiter  and  an 
increased  impurity  level  due  to  the  plasma  wall 
contact  make  this  region  even  more  complex.  A 
good  understanding  of  the  physics  of  the  edge 
plasma  in  a  tokamak  can,  however,  be  crucial  for 


fusion,  as  the  fluctuations  in  the  edge  can  substan¬ 
tially  affect  the  global  plasma  confinement.  The 
study  of  the  linear  and  nonlinear  behavior  of  the 
waves  and  instabilities  in  the  edge  plasma  can  add 
important  information  which,  together  with  theo¬ 
retical  models,  lead  to  optimized  edge  conditions 
with  minimized  particle  and  energy  fluxes  to  the 
wall. 

The  linear  features  of  fluctuations  in  the  edge  of 
tokamak  plasmas  have  been  studied  for  some  time 
with  probes  [e.g.  1-4,  29-31).  In  the  following  we 
extend  this  observation  and  give  information  about 
the  nonlinear  wave  interaction  by  the  estimation 
of  the  nonlinear  transfer  functions. 

To  make  a  direct  comparison  with  theoretical 
works  on  turbulence  in  plasmas  one  is  most  inter¬ 
ested  in  the  transfer  function  for  wavenumber 
spectra.  Such  spectra  are,  however,  not  easily  ob¬ 
tainable  in  tokamak  plasmas  as  this  would  neces¬ 
sitate  a  large  number  of  spatial  samples  with 
probes  or  a  simultaneous  measurement  of  many 
spectral  components  with  scattering  technique.  In 
a  tokamak  the  installation  of  a  Langmuir  probe 
array  has  recently  been  realized  [4],  A  large  probe 
array  may  substantially  perturb  the  plasma.  Multi¬ 
channel  scattering  experiments  are  able  to  mea¬ 
sure  instantaneously  different  wave  numbers.  For 
the  analysis  a  heterodyne  system  would  be  needed 
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and  the  influence  due  to  different  scattering 
volumes  has  to  be  understood.  To  circumvent 
these  technical  questions  we  chose  for  this  initial 
experiment  a  two  probe  technique. 

Two  spatially  separated  probes  are  capable  of 
measuring  temporal  variations  of  the  fluctuation. 
Instead  of  computing  the  bispectra  for  different 
wave  numbers  which  fulfill  the  selection  rule  k  = 
k{  +  k2  we  compute  them  for  the  frequency  com¬ 
ponents  /  =  /[  +  /:.  This  approach  enables  us  to 
detect  only  the  resonantly  coupled  components, 
for  which  the  frequency  mismatch  is  zero.  Those 
coupled  spectral  components  with  frequency  mis¬ 
match.  which  are  present  in  strong  turbulence 
causing  the  broadening  of  the  dispersion  relation, 
contribute  to  the  error  term  in  our  method. 

We  measured  the  signals  with  Langmuir  probes 
in  the  turbulent  edge  region  of  the  TEXT  tokamak 
(major  radius  100  cm,  minor  radius  a  =  21  cm). 
To  measure  the  density  fluctuations,  the  probes 
were  biased  into  ion  saturation  current  and  the 
potential  fluctuations  were  detected  with  floating 
probes.  The  signals  are  digitized  with  a  10  bit 
digitizer  with  32k  words  storage  capability  per 
channel.  Each  time  series  is  subdivided  into  512 
time  segments  of  64  data  points.  To  estimate  the 
transfer  functions  we  have  averaged  over  more 
than  1500  realizations  gathered  from  3  identical 
shots.  The  sampling  interval  used  in  the  following 
presentation  is  1  fis,  which  defines  a  temporal 
Nyquist  frequency  of  500  kHz,  which  is  well  above 
the  dominant  components  of  the  turbulent  power 
spectrum. 

We  will  consider  data  obtained  with  the  follow¬ 
ing  tokamak  parameters.  The  toroidal  magnetic 
field  is  B()  =17",  the  plasma  current  is  /0  =  100 
kA,  the  chord  averaged  density  is  1.0  x  1013  cm  3 
and  we  have  a  peak  electron  temperature  of  600 
eV.  In  the  region  behind  the  limiter  we  observed  a 
broad  turbulent  spectrum  with  fluctuation  levels 
\n/n\  of  up  to  50%.  The  edge  plasma  is  char¬ 
acterized  by  a  nonuniform  radial  electric  field 
which  changes  sign  just  inside  of  the  outermost 
closed  flux  surface.  Behind  this  flux  surface  the 
radial  electric  field  causes  an  Erx  B  plasma  rota¬ 


tion  in  the  ion  diamagnetic  drift  direction  while  it 
results  in  a  rotation  in  the  electron  direction  on 
the  inside.  The  measurements  also  exhibit  a  local¬ 
ized  instability  which  occurs  in  the  region  of  maxi¬ 
mum  veloci'y  shear  and  which  is  different  from 
the  turbulence  structure  outside  of  the  velocity 
shear.  The  measured  phase  velocity  of  the  turbu¬ 
lence  can  be  described  by  an  E,x  B  drift  super¬ 
imposed  on  a  pressure  gradient  drift  [29],  In  this 
work  we  include  a  description  of  nonlinear  effects 
for  the  region  outside  the  shear  layer.  Model  equa¬ 
tions  [23.  24,  32,  33]  applicable  in  the  edge  region 
are  characterized  by  quadratic  nonlinearities, 
hence,  the  three  wave  interaction  is  the  relevant 
wave  coupling  process.  Model  equations  with  cubic 
nonlinearities  [34.  35]  giving  rise  to  four  wave 
interaction  are  generally  not  considered  in  the 
edge  plasma  context. 

Our  goal  is  to  estimate  the  linear  and  quadratic 
transfer  functions  between  the  signals  of  two 
poloidallv  separated  points.  The  experimental 
setup  is  as  follows:  two  radially  movable  Lang¬ 
muir  probes  are  located  at  the  top  of  the  toka¬ 
mak.  They  are  separated  by  Sx  =  3.5  mm.  For  the 
dominant  power  at  low  frequencies,  this  sep¬ 
aration  is  small  compared  to  the  poloidal  correla¬ 
tion  length  (several  centimeters).  In  this  paper,  we 
analyze  data  taken  at  a  radial  position  1  cm 
behind  the  limiter.  The  density  at  this  location  is 
approximately  5.0  x  10"  cm  \  the  electron  tem¬ 
perature  Te  =  10  eV  and  \n/n\  =  40%.  The  power 
spectra  of  the  density  fluctuations  monitored  by 
both  probes  located  at  this  radial  position  are 
shown  in  fig.  6.  Both  probes  yield  nearly  the  same 
spectrum. 

The  data,  which  have  been  digitized  and  stored 
in  the  computer,  are  processed  using  the  procedure 
discussed  in  section  2.  The  propagation  direction 
is  first  determined  by  observing  the  sign  of  the 
phase  shift  of  the  cross  power  spectrum  between 
the  two  probes.  The  signal  of  the  probe  which  first 
samples  the  turbulent  structures  (i.e.  the  “up¬ 
stream"  probe)  is  treated  as  the  input  signal.  The 
“downstream"  probe  signal  is  considered  as  the 
output. 
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Fig.  6.  Power  spectrum  of  the  density  fluctuate  i.o  cm  be-  Propagating  a  distance  Ax.  An  amplitude  smaller 

hind  the  limiter  for  the  two  Langmuir  probes,  separated  by  3.5  than  one  can  be  due  to  a  linear  damping  mecha- 

mm  m  the  poloidal  direction.  nism  or  a  transfer  of  energy  to  other  waves  as  a 

result  of  nonlinear  wave-wave  coupling.  Other 
The  linear  transfer  function  estimated  from  this  reasons  for  the  change  of  the  amplitude  also  exist, 

data  is  presented,  for  easier  physical  interpretation  as  for  example,  the  neglect  of  higher-order  nonlin- 

tn  terms  of  amplitude  and  phase  of  Lf  =  ear  terms,  an  external  source  of  noise  or  a  multidi- 

\Lf\tKp{\dL(f)).  Amplitude  and  phase  are  shown  mensional  behavior  of  the  fluctuation.  In  fig.  7a  we 

in  figs.  7a. b.  The  phase  of  the  linear  transfer  observe  that  \Lf\  is  always  smaller  than  unity  and 

function  0L(f)  gives  the  phase  relationship  be-  drops  off  rapidly  for  frequencies  greater  than  350 

tween  the  input  and  the  output  signal  due  to  kHz.  We  conclude  that  all  spectral  components  are 

linear  effects.  In  our  experiment.  0L(f)  can  be  damped;  however,  we  would  require  an  unphysi- 

related  to  the  mean  dispersion  relation.  ke(f).  cally  large  damping  coefficient  to  describe  the 

because  a  wave  which  propagates  with  wavenum-  damping  at  high  frequencies.  For  more  insight  we 

her  ke(  / ).  will  undergo  a  phase  shift  in  propaga-  look  at  the  “goodness  of  fit”  of  the  model  by 

ting  between  the  two  probes  equal  to  k,(  f ) Ax.  In  plotting  the  coherency  (fig.  8).  The  total  coherency 

fig.  7b  we  give  the  phase  information  as  well  as  the  of  the  model  (eq.  (5)),  y 2(  / ),  is  high  for  spectral 

related  averaged  wavenumber  k,(f)  =  0L(  f)/Ax.  components  up  to  the  frequency  where  the  ampli- 

We  find  an  approximately  linear  dispersion  rela-  tude  of  the  linear  transfer  function  starts  to  de- 

tion  and  thus  a  nearly  constant  phase  velocity  of  crease  rapidly.  The  fast  increase  of  y „(/)  =  1.0  - 

the  wave  over  the  observed  frequency  range.  At  y  2(  / )  for  frequencies  larger  than  350  kHz  demon- 

the  radial  position  discussed  here  (1  cm  behind  the  strate  that  systematic  errors  become  important  at 

limiter),  the  propagation  direction  of  the  phase  high  frequencies.  The  effect  of  instrument  noise 

velocity  is  in  the  ion  diamagnetic  drift  direction.  can  be  neglected,  as  the  power  of  the  signal  is 

The  magnitude,  \Lf\,  gives  the  change  of  the  significantly  larger  than  the  noise  power  for  all 

signal  level  between  the  input  X,  and  the  output  frequencies.  We  believe  that  the  deviation  of  the 

Yf  due  to  linear  effects.  An  amplitude  \Lf\  larger  coherency  from  unity  is  mainly  due  to  the  one- 

than  unity  indicates  a  wave  which  grows  while  dimensional  approach  we  have  chosen.  In  our 
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Fig  8  Coherence  between  the  two  probes 

experimental  set  up,  we  look  only  at  the  poloidal 
component  of  the  fluctuation.  When  a  turbulent 
structure  propagates  at  an  angle  with  respect  to 
the  two  probes  it  may  be  observed  by  only  one  of 
the  probes  and  will  cause  a  decorrelation  between 
the  signals.  This  effect  will  dominate  when  the 
scale  length  and  thus  the  wave  length  of  the  struc¬ 
tures  becomes  very  small,  as  is  the  case  for  high 
frequencies  (see  fig.  7b). 

We  now  turn  our  attention  to  the  quadratic 
transfer  function,  for  which  the  amplitude,  |  Q\' J- 1 , 
is  shown  in  a  three-dimensional  plot  in  fig.  9. 
During  the  propagation  between  the  two  probes, 
only  a  relatively  small  amount  of  the  spectral 
amplitude  of  the  output  signal,  Y,,  is  generated  at 
frequency  /  =  /,  +  f2  >  /,,  f2  due  to  wave-wave 
interaction  with  the  frequency  components  /,  and 
/,  (see  region  (I)  in  Fig.  3a).  The  amount  of  the 
signal  which  is  transferred  to  intermediate  fre¬ 
quencies  due  to  coupling  with  a  higher  frequency 
and  a  lower  frequency  component  is  larger.  The 
largest  values  of  the  transfer  function  occur  in  the 
shaded  region  in  fig.  9,  corresponding  to  region 
(III)  in  fig.  3a.  We  conclude  from  this  that  the 
quadratic  transfer  function  is  strongest  for  the 
dominant  low  frequency  components  of  the  spec¬ 
trum  in  fig.  6  (around  30  kHz)  as  a  result  of  their 


YU 

interaction  with  two  waves  of  similar  frequency. 
The  strength  of  this  wave-wave  coupling  increases 
when  /,  and  /,  approach  the  Nyquist  frequency. 
We  must  keep  in  mind,  however,  that  because  of 
the  low  power  at  high  frequencies  the  absolute 
contribution  of  the  high  frequency  components  to 
the  output  signal  is  small,  despite  a  large  value  of 
the  transfer  function. 

When  we  examine  the  coherency  due  to  qua¬ 
dratic  interaction  of  the  waves  Yq(/)  (fig.  8a)  we 
find  very  low  values.  The  fraction  of  the  total 
power  at  frequency  /  which  originates  from  inter¬ 
action  with  all  frequency  pairs  which  satisfy  the 
selection  rule  /  =  /j  -t-  /2  is  therefore  small  for  a 
probe  separation  of  3.5  mm.  The  fact  that  the 
magnitude  of  the  mixed  term  Ylq(/  ) 's  larger  in 
magnitude  than  Yq(  / )  is  not  surprising.  For  small 
probe  separations  the  turbulent  structures  reach¬ 
ing  the  first  probe  are  still  present  when  they  reach 
the  second  probe  (due  to  the  linear  transfer)  and 
contribute  even  more  to  the  output  signal  than 
that  generated  due  to  quadratic  interaction  be¬ 
tween  the  two  probes. 

Having  discussed  the  various  sources  of  errors 
which  affect  the  “goodness  of  fit”  of  the  model,  we 
have  to  ask  if  the  measured  quadratic  transfer 
function  gives  a  meaningful  result.  This  question 
is  critical,  as  we  know  from  the  coherency  of  the 
quadratic  term  as  well  as  from  theoretical  models 
that  the  quadratic  interaction  is  small.  Also,  the 
variance  of  the  estimate  is  larger  than  that  of  the 
linear  one,  just  as  an  estimate  of  a  two-dimen¬ 
sional  surface  has  more  variance  than  an  estimate 
of  a  one-dimensional  curve,  using  the  same  num¬ 
ber  of  data  points.  We  have,  however,  good  indi¬ 
cation  that  the  quadratic  coefficient  is  qualitatively 
correct.  When  we  look  at  the  scaling  of  the  coeffi¬ 
cients  with  probe  separation,  we  find:  for  an  in¬ 
creased  spacing  of  the  probes,  Yq(/)  increases  and 
Ylq(  / )  decreases.  We  can  expect  such  a  result  for 
a  correct  measurement:  for  an  increased  probe 
separation,  the  interaction  time  for  quadratic 
processes  and  linear  damping  is  longer.  This  ob¬ 
servation  indicates  that  the  quadratic  transfer 
function  is  estimated  correctly. 
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Fig  'i  Three-dimensional  plot  of  the  amplitude  of  the  quadratic  transfer  function,  obtained  from  the  densits  fluctuations  measured 
at  the  two  probes 


How  do  these  results  compare  with  different 
theoretical  predictions?  Fundamental  differences 
are  predicted  theoretically  for  two-dimensional  and 
three-dimensional  turbulence:  in  three-dimen¬ 
sional  turbulence  the  energy  cascades  due  to  vortex 
stretching  from  large  scale  turbulent  structures  to 
small  scale  structures.  In  two-dimensional  turbu¬ 
lence  the  energy  is  predicted  to  cascade,  in  con¬ 
trast  to  the  three-dimensional  model,  to  smaller 
wave  numbers  and  frequencies,  as  the  vorticity  is 
independent  of  the  third  direction.  For  tokamak 
plasmas  with  inherently  strong,  mainly  toroidal, 
magnetic  field  a  two-dimensional  behavior  of  the 
turbulence  is  generally  expected.  The  essentially 
two  dimensional  picture  of  the  turbulence  can  be 
justified  by  the  strong  magnetic  field  which  con¬ 
fines  the  charged  particles  in  the  perpendicular 
direction,  while  the  electrons  and  ions  can  propa¬ 
gate  relatively  freely  along  the  field  lines.  There 
are  many  different  theoretical  turbulence  models 
for  tokamak  plasmas  because  a  variety  of  linear 
instabilities  and  nonlinear  mode  coupling  pro¬ 
cesses  can  be  included  [36],  Our  experiment  shows 


a  dominant  coupling  of  higher  frequency  spectral 
components  in  a  way  as  to  increase  the  amplitude 
at  the  lowest  frequency  component  of  the  triplet. 
The  strongest  effect  is  found  for  coupling  of  two 
spectral  components  of  comparable  frequency  to  a 
very  low  frequency  component.  This  result  indi¬ 
cates  an  efficient  “one  step"  process  between  rela¬ 
tively  small  scale  turbulent  structures  (k>  10 
cm  1 )  to  large  scales  ( k  <,  5  cm'1).  The  experi¬ 
ment  indicates  that  the  turbulence  in  the  edge 
plasma  of  the  TEXT  tokamak  is  essentially  of 
two-dimensional  nature  as  expected  theoretically 
for  tokamak  plasmas. 

We  mentioned  in  the  introduction  that  much 
work  dealing  with  turbulence  is  based  on  linear 
transfer  functions  [e  g.  1  -4,  29).  The  contribution 
of  the  nonlinear  term  in  eq  (2)  can  then  be 
neglected.  The  magnitude  of  the  quadratic  transfer 
function  and.  thus,  the  coherency  Yq(/)  we  ob¬ 
served  suggest  that  such  an  assumption  is,  indeed, 
valid.  The  approximation  can  be  confirmed  quan¬ 
titatively.  When  we  neglect  the  quadratic  terms 
and  cpmpute  the  linear  transfer  function  (eq.  (2a)) 
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we  find  values  which  are  very  close  to  the  ones  for 
the  non-Gaussian  input  case  (dashed  lines  in  tigs. 
7a  and  b).  While  our  results  are  obtained  for 
turbulence  observed  in  the  edge  plasma  of  the 
TEXT  tokamak  with  probes  separated  by  d.t  =  3.5 
mm.  they  suggest  that  the  quadratic  terms  can  be 
neglected  for  similar  experimental  arrangements 
when  one  is  interested  only  in  the  linear  behavior 
of  fully  developed  turbulence  in  the  edge  region  of 
tokamaks.  Note  that  the  neglect  of  the  nonlinear 
terms  in  the  estimation  of  the  linear  transfer  func¬ 
tion  can  lead  to  an  incorrect  result  for  cases  with 
small  linear  component  l.p.  as  in  this  case  the 
contribution  of  the  quadratic  term  to  eq.  (2)  can 
be  considerable. 


5.  Conclusion 

We  have  found  that  the  usual  method  of  esti¬ 
mating  linear  and  quadratic  transfer  functions  be¬ 
tween  two  measurement  points  for  Gaussian  input 
signals  can  be  extended  to  non-Gaussian  input 
signals.  This  extension  leads  to  a  model  with  which 
a  self-excited  and  fully  turbulent  system  cart  be 
studied  experimentally.  The  model  is  tested  with 
simulated  data  and  can  reproduce  the  transfer 
functions.  A  check  of  the  "goodness  of  fit"  shows 
that  the  coherency  is  close  to  unity.  The  true 
output  signal  can  thus  be  modeled  nearly  exactly 
from  the  input  signal  by  using  the  estimated  linear 
and  quadratic  transfer  functions.  Analysis  of  the 
same  signals  with  the  usual  method,  which  as¬ 
sumes  a  Gaussian  input  signal,  gives  nonphysical 
values  of  the  coherency  with  values  larger  than 
unity . 

Using  two  Langmuir  probes  in  the  edge  plasma 
of  the  TEXT  tokamak  we  find:  the  linear  transfer 
function  yields  a  nearly  linear  dispersion  relation 
of  the  waves  with  propagation  in  the  ion  diamag- 
ne'ic  drift  direction.  All  spectral  components  are 
at  least  marginally  damped.  The  amplitude  of  the 
quadratic  transfer  function  is  largest  at  the  do¬ 
minant  low  frequencies  around  30  kHz  due  to 
coupling  with  two  waves  having  comparable  fre¬ 


ni 

quencies  and  lead  to  an  increase  of  the  amplitude 
at  the  low-frequency  component  This  result  indi¬ 
cates  an  efficient  "one  step"  process  between  small 
scale  and  large  scale  structures.  The  portion  of  the 
signal  which  is  generated  between  the  two  probes 
due  to  quadratic  interaction  is  small  with  respect 
to  the  linear  contributions,  but  proves  that  many 
waves  couple  with  each  other  in  a  fully  turbulent 
spectrum.  Our  observation  is  consistent  with  two- 
dimensional  turbulence  models  which  predict  an 
energy  transfer  to  smaller  scale  structures  due  to 
three-wave  coupling,  which  is  in  contrast  to  three 
dimensional  models  with  energy  cascading  to 
larger  scales. 

We  demonstrated  in  this  paper  the  usefulness  of 
the  method  to  estimate  the  linear  and  quadratic 
transfer  function  for  non-Gaussian  inputs.  Al¬ 
though  we  applied  the  method  to  plasma  turbu¬ 
lence  data,  we  emphasize  that  the  method  is  equally 
applicable  to  analyze  other  data,  as  in  fluid  turbu¬ 
lence.  for  example.  In  a  future  work,  we  will 
extend  this  method  to  estimate  the  three-wave 
coupling  coefficients  and  thus  quantify  energy 
cascading  in  turbulence.  Such  experimental  mea¬ 
surements  can  be  useful  for  a  direct  comparison 
with  theoretical  models. 
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A  new  honodyne  spectroscopy  technique  has  been  applied 
for  the  first  time  to  tokamak  mi croturbul ence  measurements 
in  order  to  ascertain  the  frequency  spectra  and  wave 
propagation  direction  of  low-frequency  density 

fluctuations.  This  method  is  employed  m  lieu  of  more 
expensive  end  complicated  heterodyne  detection  schemes 
typically  available  for  far-infrared  laser  scattering 
systems 

Introduction 

Laser  and  millimeter-wave  scattering  techniques  are 
commonly  used  to  study  the  space-time  statistics  of 

electron  density  fluctuations  in  tokamak  and  other  plasmas 
Of  particular  importance  is  the  determination  of  the 
direction  of  propagation  of  the  fluctuations.  Since  the 
scattering  geometry  fixes  the  direction  of  the  scattering 
wave  vector  k.  the  direction  of  propagation  information  is 
carried  by  the  sign  of  the  fluctuation  frequency  w  The 
fact  that  waves  may  be  propagating  both  parallel  and 
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antiparallel  to  k  is  manifested  by  the  presence  of  blue  and 
red  sidebands  centered  around  the  incident  wave  frequency 

"o' 


To  recover  the  propagation  direction  information 
contained  in  the  blue  and  red  sidebands.  heterodyne 
detection  techniques  are  typically  employed  This  approach 
requires  two  coherent  sources,  with  a  frequency  difference 
A cj  .  to  be  utilized  as  the  incident  and  local  oscillator 
beams  After  mixing,  the  frequency  range  of  the  resultant 
signal  i s  Aw  -  w ,  where  u  is  the  frequency  associated  with 
the  plasma  fluctuations.  As  long  as  A u  »  w,  the  blue/red 
sidebands  may  be  resolved.  In  contrast.  if  a  classical 
horaodyne  approach  is  used,  A u  =0,  and  it  l s  no  longer 
possible  to  unambiguously  determine  the  wave  propagation 
direction. 

Realization  of  a  heterodyne  detection  system  in  the 
far-infrared  is  expensive  and  technically  nontrivial. 
Utilization  of  a  rotating  grating  to  frequency  shift  a 
portion  of  the  source  beam  is  a  feasible  alternative 
although  the  frequency  offset  is  limited  to  roughly  A v  $ 
150  kHz  (insufficient  for  microturbulence  measurements 
where  fluctuations  are  observed  up  to  1  MHz).  In  addition, 
there  is  noise  associated  with  the  grating  which  limits 
resolution  near  zero  frequency  and  fabrication  can  be 
costly. 

Experiment, al  Tecbniauc  And  AmaeaLua 

A  considerably  simpler  and  inexpensive  method  proposed 
by  Tsukishima1  and  Asada  et  al . ,  permits  detection  of  the 
wave  propagation  direction  frc=  the  analysis  of  homodyne 
signals.  The  IF  output  of  the  scattered  signal  after  the 
mixer  is  a  real  quantity  described  by 


v(t)  =  Re  \ftZ  T 


where  k+=ks~ 
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laboratory  frame  of  reference  and  thus  N(k+,w)  #  N(k+,-w) 

However.  front  the  real  time  signal  v(t),  one  is  restricted 
to  the  reconstruction  of  the  spectrum  N’(k+,«j)  with  the 
symmetry  property  N'(k+iMi)  =  N'  (k+.-w).  thereby  losing 
wave  propagation  direction  information  The  key  idea  of 
the  new  homodyne  spectroscopy  technique  proposed  by 
Tsukishima*  is  to  reconstruct  the  complex  time  signal  w(  t ) 
and  recover  the  complete  wave  information  by  using  two 
horaodyne  IF  signals  v^t)  and  v2(t)  which  are  phase  shifted 
by  90°  with  respect  to  each  other  permitting  one  to  write 
w(t)  =  Vj(t)  +  i  v2(t)  12 

The  blue  and  red  sidebands  of  the  scattered  radiation. 
S+(<j)  and  S_(u)  where  St(w)  «  [nt(u)J  .  can  be  readily 
calculated  from  the  two  IF  signals  Vjft)  and  v2(t) 


S*(w)  =  (Gjj(<j)  +  022(01))  *  2  Im{Gj2(o>)).  01  >  0. 


where  Gjj(oi)  and  G22(o>)  are  the  auto-power  and  Gjgfoi)  the 
cross-power  spectral  densities  of  the  two  signals  Vj(t)  and 
vg(t).  and  are  given  by 


Glk(oi)  =  <V1(ui)V*(o.)>  .  1  ,k  =  1.2 


The  spectral  component  V((u»)  *  / ( v  ( t))  is  the  Fourier 
transform  of  the  IF  signal  v^t)  and  <  >  denotes  an 
ensemble  average  over  many  statistically  similar 
real  1 zat 1 ons 

In  the  experimental  results  to  be  shown  later,  a  time 
series  of  32k  data  points  (length  of  the  t ime  sample  I  ■  H 
ms  at  a  sampling  rate  of  2  MHz)  was  subdivided  into  128 
realizations  of  286  data  points  (T  ■  122  ms)  By 
eaqiloying  a  fast  Fourier  transform  algorithm,  the  frequency 
spectrum  was  obtained  with  a  resolution  of  Au//2n  *  1/T'  •  8 
kHz  Mixer  and  amplifier  noise  contributions  could  be 
subtracted  from  the  autopower  spectra  although 
signal-to-noise  levels  were  sufficiently  large  so  as  to 
make  it  unnecessary 
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the  use  of  quasi-opt ical  biconmcal  Schottky  barrier  diode 
mixers.  The  frequency  shifted  scattered  radiation  may  be 
collected  over  a  range  of  wave  vectors  from  0  <  k^  <  15 
cm  *  Detailed  information  on  the  scattering  system  and 
calibration  procedures  is  described  by  Park  et  al.4'^  The 
portion  of  the  far-infrared  laser  utilized  as  the  probe 
beam  is  weakly  focussed  along  a  vertical  chord  to  a  waist 
of  =  2  cm  producing  a  measured  wavenumber  resolution  Ak^  = 
M  cm-1.  The  length  of  the  scattering  volume  varies  as  a 
function  of  wavenumber  and  ranges  from  ±8  cm  (e~*  point  of 
scattered  power)  at  k^  =  12  cm-1  to  a  chord  average  as  k  -• 
0.  Scattered  radiation  in  the  plane  perpendicular  to  the 
toroidal  magnetic  field  is  examined. 

At  a  particular  wave  vector  k.,  the  scattered 
radiation  beam  is  divided  equally  into  two  components  which 
are  coupled  into  detectors  1  and  2  by  50  %  reflectivity 
beam  splitters.  Similarly,  the  local  oscillator  beam  is 
equally  divided  to  provide  rf  drive  for  each  mixer.  In  one 
leg  of  the  local  oscillator  beam  (det.  2),  a  phase  shifter 
is  inserted.  This  phase  shifter  consists  of  a  piece  of 
high  density  polyethylene  (excellent  transmission 
characteristics  at  245  GHz)  mounted  on  a  rotation  stage. 
By  tuning  the  rotation  angle,  the  path  of  the  LO  beam 
through  the  polyethylene  is  altered  thereby  changing  its 
phase  with  respect  to  det.  1.  This  phase  shifter  is  tuned 
such  that  there  is  a  90°  phase  difference  between  dets.  1 
and  2.  The  signal  from  each  detector  is  then  amplified  and 
digitized  so  that  the  cross-  and  auto-  power  spectra  may  be 
computed. 


Experimental  Results 

The  above  described  technique  may  now  be  applied  to 
density  fluctuation  measurements  in  a  high  temperature 
tokamak  plasma.  Microturbulence  (low-frequency  density 
fluctuation)  is  driven  by  the  free  energy  associated  with 
plasma  inhomogene i t ies  such  as  density  and  temperature 
gradients.  For  drift  wave  type  fluctuations,  it  is 
predicted  that  the  turbulence  will  exhibit  a  phase  velocity 
-De  =  ‘“'/fc©  =  lkBTe/eBTnel  7ne  x  %/!%!■  where  is  the 
electron  diamagnetic  drift  velocity,  kg  and  w  are  the 
poloidal  wave  vector  and  frequency  of  the  fluctuation.  Rj. 
is  the  toroidal  magnetic  field,  Te  is  the  electron 
temperature,  and  ne  is  the  electron  density  The  geometry 
for  scattering  from  electron  drift  waves  is  shown 
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schema 1 1 ca I  1 y  in  Fig  2  The  scattering  system  is 
positioned  such  that  the  incident  beam  impinges  upon  the 
plasma  from  the  top  of  the  torus  (also  see  Fig  1)  along  a 
vertical  chord  at  the  major  radius  R  =  1  m  This  provides 
for  scattering  from  fluctuations  with  a  poloidal  wave 
vector.  Ijq  Depending  upon  the  orientation  of  the 
collection  optics  with  respect  to  the  incident  beam,  the 
wavenumber  matching  condition  (momentum  conservation)  gives 
ks  =  kQ  t  k  where  &s .  .  k  are  the  wave  vectors  of  the 
scattered  beam,  incident  beam.  and  plasma  fluctuation, 
respectively  It  is  important  to  note  that  for  a 
particular  scattering  geometry,  a  sign  change  will  occur 
(±ll)  when  one  switches  from  the  plasma  top  to  bottom  (see 
Fig  2  (a)  and  (c)  or  (b)  and  (d)) 


(a)  k,  -  k0  +  (b)  ks  »  k0  -  k 


Figure  2  Tokamak  scattering  geometry 


Experimental  results  from  the  TEXT  tokamak  (major 
radius  R  =  lm  and  minor  radius  a  =  27  cm)  are  shown  in  Fig 
3.  for  a  scattering  volume  located  at  the  plasma  bottom 
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with  poloidal  wave  vector  kQ  =  7  c»  1  The  discharge 

parameters  were  lp  =  400  kA  BT  =  28  kG .  and  ne  =  2  x  10^3 
cm  In  Figs  3  (a)  and  (b),  the  homodyne  power  spectra 
Sk(u/2n)  from  the  two  mixers  are  illustrated  Each  is 
characterized  by  a  broad  spectra  which  falls-off  in  power 
for  w/2n  300  kHz  The  homodyne  spectra  provide  no 
information  regarding  wave  propagation  direction  as  the  tu 
components  are  detected  as  |iu|  Now  however.  by 
implementing  the  homodyne  spectroscopy  technique  of 
Tsuk i shiraa  wave  propagation  direction  information  can  be 
ascertained  as  depicted  in  Fig  3(c)  Here,  the 

fluctuations  are  observed  to  possess  a  clear  peak  at  +w/2n 
“  250  i  50  kHz  in  the  electron  diamagnetic  drift  direction 
as  measured  in  the  laboratory  frame  of  reference  This 
indicates  a  fluctuation  phase  velocity  v  h(=w/kQ)  *  2  x  10* 
cm/sec  which  is  in  the  drift  wave  region  of  velocities  ,®  A 
substantial  component  is  also  observed  at  cj/2 n  <  0. 
corresponding  to  the  ion  drift  direction  The  scattering 
geometry  for  this  measurement  is  oriented  according  to  Fig 
2(c),  i  e  k  =  k0  -  k  I f  we  reverse  the  geometry  to 

that  of  Fig.  2(d),  i.e  ks  =  k0  +  k.  one  would  expect  a 
change  in  sign  from  the  results  of  Fig,  3(c),  which  is 
indeed  the  case  as  shown  in  Fig  3(d)  The  features  of 
the  scattered  spectra  are  the  same  except  that  -w  now 
corresponds  to  the  electron  drift  direction 

The  component  of  the  frequency  spectra  corresponding 
to  the  ion  drift  direction  may  result  from  factors  other 
than  a  true  ion  drift  feature  of  the  plasma  On  the  TEXT 
tokamak.  density  fluctuations  in  the  limiter  shadow  and 
scrape-off  regions  have  been  observed  to  propagate  in  the 
ion  drift  direction  due  to  a  strong  radial  electric  field 
inducing  a  plasma  rotation  effect  7  Another  possibility  is 
that  the  interaction  volume  may  extend  to  the  opposite  side 
of  the  plasma  thereby  introducing  components  at  tw  although 
both  represent  the  same  propagation  direction  This  effect 
will  be  described  more  thoroughly  in  the  ensuing  paragraph 

In  Fig  4.  the  frequency  spectra  at  wave  vector  kg  * 
7  cm  *  are  shown  at  three  spatial  positions  along  a 
vertical  chord  through  plasma  center,  scattering  volume  (L 
=  -  14  cm)  positioned  at  the  plasma  top  midplane  and 

bottom  The  tokamak  discharge  conditions  were  1  =  300  kA . 

By  =  20  kG .  and  ne  =  3  *  lo'"  cm-^  At  the  plasma  top  (see 
Fig  4(a)),  the  low  frequency  density  fluctuations  are 
observed  to  be  propagating  largely  in  the  electron 
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Figure  4  Homodyne  spectroscopy  frequency  spectre  for 
scattering  volume  centered  at  plasma  (a)  top,  (b) 
■idplane.  and  (c)  bottom 


456 


Brower  el  >1. 


diamagnetic  drift  direction  with  a  peak  at  u>/2n  *  -300  kHz. 
The  fluctuations  traveling  in  the  ion  drift  direction  are 
typically  at  much  lower  frequency  with  no  clear  peak  except 
at  zero  frequency  Likewise,  at  the  plasma  bottom  (see 
Fig  4(c)).  the  fluctuations  are  again  observed  to  be 
propagating  primarily  in  the  electron  drift  direction  with 
cj/2 n  =  +300  kHz  The  change  in  sign  results  from  the 
reversal  in  direction  of  the  fluctuations  wave  vector 
with  respect  to  the  fixed  wave  vectors  kQ  and  ks  of  the 
incident  and  scattered  radiation  (see  Fig  2).  When  the 
scattering  volume  is  centered  on  the  midplane  (see  Fig. 
4(b)),  fluctuatons  are  detected  both  above  and  below  the 
midplane  resulting  in  peaks  at  -  u/2n .  The  wave 

propagation  direction  can  only  be  resolved  if  the 
scattering  volume  is  situated  completely  above  or  below  the 
midplane.  Similar  observations  have  been  made  at  other 
wavenumbers 


Suamary 

The  wave  propagation  direction  of  microturbulence  in  a 
tokamak  plasma  has  been  accurately  measured  by  application 
of  a  new  homodyne  spectroscopy  technique.  This  method  has 
been  used  in  conjunction  with  a  collective  far-infrared 
laser  scattering  experiment  on  TEXT.  The  low-frequency 
density  fluctuations  are  observed  to  propagate  primarily  in 
the  electron  diamagnetic  drift  direction,  however.  the 
broadband  spectra  also  possess  an  appreciable  level  of 
fluctuations  traveling  in  the  ion  drift  direction 
Application  of  the  homodyne  spectroscopy  technique 
represents  an  inexpensive  and  easily  implemented 
alternative  to  the  more  technically  demanding  heterodyne 
schemes  available  in  the  far-infrared. 
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The  objective  of  this  paper  is  to  describe  the  principles  of  digital  complex  demodulation,  and  to 
summarize  its  advantages  with  respect  to  rapid  time  response  and  insensitivity  to  noise.  These 
advantages  are  demonstrated  by  application  to  interferometry  data  collected  on  the  Texas 
experimental  tokamak  (TEXT). 


INTRODUCTION 

Interferometry  involves  the  measurement  of  the  line-inte¬ 
grated  index  of  refraction  of  a  medium.  In  any  interferome¬ 
ter  a  beam  of  coherent  radiation  is  split,  passed  along  two 
paths  (known  as  the  reference  and  working  arms),  and  re¬ 
mixed  on  a  detector  surface.  Changes  in  the  index  of  refrac¬ 
tion  of  the  medium  along  one  of  these  paths,  the  working 
arm,  alter  the  interference  of  the  remixed  beams,  changing 
the  detector  output.  If  the  variable  medium  is  a  plasma,  mea¬ 
surement  of  the  detected  signal  leads  to  an  estimate  of  the 
line-averaged  electron  density  along  the  working  beam  path. 

In  the  simplest  interferometer,  the  probe  beam  associat¬ 
ed  with  the  working  arm  is  mixed  with  a  reference  beam 
whose  phase  is  fixed.  However,  this  simple  scheme  suffers 
from  a  severe  limitation.  It  is  not  possible  to  distinguish 
between  the  cases  of  increasing  and  decreasing  plasma  den¬ 
sity.  This  shortcoming  may  be  overcome  by  introducing  a 
small  constant  frequency  shift  ca„  between  the  beams  of  the 
working  and  reference  arms.  The  detector  output  becomes  a 
phase-modulated  sine  wave  at  frequency  eo0  with  the  relative 
phase  proportional  to  the  line-integrated  plasma  density. 
The  phase  of  the  detector  output  is  obtained  by  comparison 
of  this  signal  with  a  reference  signal  oscillating  at  the  fre¬ 
quency  e a0.  In  an  electronic  system,  the  relative  phase  of  the 
detector  signal  with  respect  to  the  reference  signal  is  ob¬ 
tained  via  sine  and  cosine  producing  phase  comparators. 
Electronic  systems  are  attractive  for  the  following  reasons: 
the  comparator  output  signals  require  only  simple  process¬ 
ing,  allowing  for  real-time  data  reduction  and  the  computer 
signals  need  only  be  sampled  at  a  low  rate.  There  are  also 
several  disadvantages:  the  comparators  are  sensitive  to  var¬ 
iations  in  the  amplitude  of  the  detector  and  reference  signals; 
the  comparators  may  introduce  large  errors  into  the  phase 
calculation;  and  electronic  systems  have  difficulty  coping 
with  transient  noise.  As  a  result  of  this  noise  sensitivity, 
fringes  are  often  skipped  or  added.  To  counter  the  effects  of 
noise,  the  time  response  of  the  electronic  system  is  length¬ 
ened  (i.e.,  bandwidth  reduced).  Unfortunately,  however, 
this  has  the  undesirable  effect  of  "washing  out"  rapid  fluctu¬ 
ations  in  plasma  density  associated  with  phenomena  such  as 
sawtooth  oscillations 


The  objective  of  this  paper  is  to  describe  the  principles 
and  application  of  digital  complex  demodulation,  which  is 
the  equivalent  of  digital  heterodyning.  The  advantages  of  the 
digital  technique  include  an  insensitivity  to  transient  noise, 
faster  time  response,  and  improved  accuracy  of  the  phase 
measurements. 

In  the  following  sections,  we  describe  the  principles  of 
complex  demodulation,  our  approach  to  avoiding  2rr  phase 
ambiguities,  the  effects  of  noise,  and  an  application  to  inter¬ 
ferometry  data  collected  on  the  Texas  experimental  toka¬ 
mak  (TEXT). 


Digital  complex  demodulation  is  a  digital  version  of 
analog  heterodyne  demodulation  and  allows  the  simulta¬ 
neous  measurement  of  the  amplitude  and  phase  modulations 
of  a  narrow-band  modulated  carrier  signal  as  functions  of 
time.  This  technique  has  been  shown  to  be  effective  and  flexi¬ 
ble  compared  with  other  demodulation  techniques.1  In  the 
following,  we  outline  the  key  ideas  of  digital  complex  de¬ 
modulation. 

A  sampled  narrow-band  signal  x(t„ )  at  a  "carrier  fre¬ 
quency”  cou,  whose  amplitude  and  phase  are  modulated,  may 
be  described  by 

x(t„  5  =  A{t„  )cos[tu„(rn )  +  <?(/■„)]  ,  (1) 

where  A(tn)  and  9(t„ )  are  the  respective  amplitude  and 
phase  modulates  at  the  time  t„  =  n£xt.  Equation  ( 1 )  may  be 
rewritten  in  the  form 

x(tn)  =  l/2A(tn)(exp[iio0t„  4 ■  iO(t„)] 

+  exp{  -  /[&>„/„  +  0(t,  )  ]}) .  (2) 

As  in  analog  demodulation,  we  must  downshift  the  frequen¬ 
cy  by  an  amount  —  a>ty  In  digital  demodulation  this  down¬ 
shifting  is  accomplished  by  simply  multiplying  (in  the  com¬ 
puter  )  the  time  series  data  representing  Eq.  ( 1 )  or  ( 2 )  by  a 
"local  oscillator"  component  2  exp(  —  tu>  „t„ ).  This  results 
in  difference  and  sum  frequency  terms,  ru„  -u„  and 
at„  +  to  o ,  respectively.  By  choosing  to  „  =  the  difference 
frequency  is  set  to  zero. 
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x(t„  )2  exp(  -  iu0t„  ) 

=  A(t„  )exp[/0(r„ )  ]  +  exp{  -  i[lo>ot„  +  8(t„ )]}  • 

(3) 


Also,  as  in  analog  demodulation,  a  low-pass  filter  is  applied 
to  the  signal.  In  this  case  a  digital  low-pass  filter  is  applied 
whose  passband  cjf  is  chosen  to  block  the  sum  frequency 
component,  ojf  <oj0  +  0,  but  to  admit  the  inherent  band¬ 

width  of  the  carrier  due  to  phase  modulation,  coF  >  | dd(t)/ 
dt  |m„ .  A  linear  digital  low-pass  filter,  MAXFLAT,2  is  se¬ 
lected  for  this  purpose.  This  filter  has  a  variable  passband 
and  a  variable  transition  region  in  which  the  filter  response 
drops  from  0.95  to  0.05.  The  amplitude  modulation  A(t„  ) 
and  phase  modulation  8(t„ )  are  easily  recovered  from  the 
digital  filter  output,  y(r„  )  =  A(t„  )exp[/i9(/„ )), 

<4(0  =  LKOl  .  (4) 


0{t,)  =  arctanf *  M  +  2.Vir  . 

\  Re[  v(r„ )  ]  / 


where  N  is  any  integer. 


(5) 


II.  PHASE  AMBIGUITIES  AND  NOISE 

As  a  result  of  the  multivalued  nature  of  the  arctangent 
function,  there  is  a  2 rr  ambiguity  in  the  phase  demodulation; 
we  make  two  assumptions:  ( 1 )  The  original  phase  demodu¬ 
late  8(i)  before  sampling  is  continuous.  This  assumption  is 
reasonable  if  A  (t)  never  crosses  zero,  since  the  bandlimited- 
ness  of  the  modulated  signal  x(t)  ensures  the  continuity  of 
0(r).  (2)  The  change  in  the  calculated  phase  modulate 
between  any  two  consecutive  samples  is  less  than  rr.  Thus,  if 
one  obtains  a  true,  unambiguous  value  of  8(t„ ),  this  condi¬ 
tion  guarantees  an  unambiguous  value  of  8(tn  +  ,  )  by  re¬ 
stricting  the  range  of  possible  values  in  the  interval 

0(0  -  rr  <0(r„  +  |  )  <9(t„)  +  rr  . 

The  length  of  this  interval  is  2  rr. 

We  now  show  that  the  proper  choice  of  the  sampling 
rate  ensures  that  the  phase  shift  between  two  samples  is  less 
than  rr.  The  function  x(t„  )  is  a  product  of  A(t„)  and 
exp{/[«y0/n  +0{t„)]}.  According  to  the  convolution 
theorem,  the  bandwidth  B,  of  x(rn)  is  greater  than  the 
bandwidth  of  either  A(t„  )  or  exp{/|£U()/„  +  8(tn  )  ] }.  Thus, 
we  have  |A0(r„  )/2 rr&t  |m.,  <  Bx  This  inequality  may  be 
written  in  terms  of  the  Nyquist  frequency  f  s  =  (2A/)"\as 
|  A0(/„  ):mj,  <  n{BK/  f  v  )  If  the  sampling  frequency  is  suf¬ 
ficiently  high  to  satisfy  the  sampling  theorem  (Bx  </>  ), 
then  f  A 8(t„  )  |mj,  <  rr,  and  the  problem  of  2rr  phase  ambigu¬ 
ities  is  removed. 

A  time  series  to  be  demodulated  may  be  contaminated 
by  noise  which  includes  the  quantization  noise  at  the  A/D 
conversion  stage  as  well  as  electronic  pickup  and  plasma 
noise.  The  multiplication  of  x(r„  )  by  exp(  -  io>  ,,r„  )  trans¬ 
lates  the  frequency  contents  of  x(r,  )  by  -  c,>  „  in  the  fre¬ 
quency  domain,  but  has  no  effect  on  thesignal-to-noise  ratio 
However,  low-pass  filtering  increases  the  signal-to-noise  ra¬ 
tio  when  the  signal  A(i„  )exp[/0(r„  )]  is  within  the  pass- 
band.  If  the  noise  is  assumed  to  be  white  up  to  the  Nyquist 
frequency  /v  and  the  low  -pass  cutoff  frequency  is /,.  then 


the  gain  of  the  signal-to-noise  ratio  is  /  N  /  f  F .  In  view  of  the 
fact  that  the  low-pass  filtering  is  being  done  digitally,  it  is  a 
relatively  simple  matter  to  adjust  the  low-pass  cutoff  fre¬ 
quency  to  enhance  the  signal-to-noise  ratio 

The  error  in  the  calculated  phase  modulate  6(tn )  de¬ 
pends  on  the  signal-to-noise  ratio  SNR(r„  )  of  the  filtered 
signal, y(  t„  ) .  To  avoid  errors  in  the  calculated  phase  due  to  a 
momentary  low  signal-to-noise  ratio,  a  threshold  value  of 
y(tn  )yml„ ,  is  defined  which  depends  on  the  amplitude  of  the 
noise.  When  the  signal  amplitude  drops  so  that  [y(/„)| 
<ymm,  the  calculated  phase  modulate  8(t„  )  is  ignored  and 
the  previously  calculated  phase  modulate  is  carried  over.  If 
the  noise  is  a  transient,  it  is  reasonable  to  neglect  the  phase 
measurement  and  assume  the  previous  value  of  the  phase  as 
the  current  value. 

In  most  interferometers  the  shift  frequency  a>0  (referred 
to  as  the  carrier  frequency  in  this  section)  is  generally  not 
known  with  perfect  precision  and  may  also  be  unstable.  The 
application  of  a  constant  digital  local  oscillator  frequency 
shift,  col0,  which  is  not  precisely  equal  to  a>0  will  therefore 
result  in  an  accumulated  phase  error  which  increases  linear¬ 
ly  in  time,  (<u0  —  o>l0  )/„ .  Other  sources  of  phase  error  in¬ 
clude  phase  distortions  introduced  by  the  low-pass  filter  and 
deviations  from  a  uniform  sampling  rate.  To  cancel  these 
errors  a  reference  signal  xr(t),  which  equals  the  carrier  sig¬ 
nal  without  the  plasma  phase  modulation  8(tn),  is  also  sam¬ 
pled, 

*,(0  =A,(tJexpUcoot„)  . 

Digital  complex  demodulation  is  then  applied  to  this  signal 
as  well,  to  yield  a  reference  phase  modulate  8r(t„).  The  de¬ 
sired  phase  modulate  6p(tn),  due  to  the  plasma  is  simply  the 
difference  of  these  calculated  phases, 

0,(0  =  0(0  -0,(0  • 

III.  APPLICATION  TO  INTERFEROMETER  DATA 

We  have  applied  digital  complex  demodulation  to  dat  . 
from  a  1.2-mm  far-infrared  laser  interferometer  on  TEXT. 
The  detector  and  reference  signals  had  carrier  frequencies  of 
80  kHz.  These  signals  were  sampled  at  200  kHz  for  325  ms. 
The  digital  low-pass  filter,  MAXFLAT,  was  set  to  a  band¬ 
pass  frequency  of  20%  of  the  carrier,  resulting  in  200  filter 
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Fig  2  Comparison  of  sawtooth  oscillations  observed  b>  electronic  demodu¬ 
lation  and  digital  complex  demodulation 

coefficients  which  characterize  the  filter  impulse  response. 
The  amplitude  threshold_ymin  for  phase  interpolation  was  set 
to  5%  of  the  maximum  signal  level  when  free  of  noise.  Figure 
1  shows  the  calculated  density  for  a  typical  shot.  Note  that 
the  digital  complex  demodulation  result  faithfully  tracks  the 
density  including  the  dip  during  the  current  rise  at  ~  25  ms. 

Figure  2  compares  data  reduced  via  complex  demodula¬ 
tion  with  data  from  a  standard  interferometer  utilizing  elec¬ 
tronic  phase  comparators.  Both  data  are  from  a  portion  of  a 
discharge  during  which  “saw  tooth"  activity  was  occurring. 
The  digitally  produced  traces  display  the  sawtooth  shape 
(generally  observed  via  other  faster  diagnostics)  while  the 
electronically  reduced  data  resemble  a  sine  wave,  due  to  the 
smoothing  resulting  from  a  derated  time  response,  the  latter 
being  necessary  to  avoid  skipped  fringes  due  to  noise  effects. 


Note  also  that  the  time  response  of  the  complex  demodula¬ 
tion  approach  is  sufficiently  fast  to  recover  the  Mirnov  oscil¬ 
lations  which  are  superimposed  on  the  sawteeth  oscillations. 

These  examples  demonstrate  that  digital  complex  de¬ 
modulation  possesses  a  number  of  advantages  with  respect 
to  insensitivity  to  noise,  faster  time  response,  and  phase  mea¬ 
surement  accuracy.  Also,  the  digital  approach  is  inherently 
flexible,  allowing  relatively  easy  adjustment  of  the  low -pass 
filter  cutoff  frequency  and  the  amplitude  threshold  The 
limitations  of  digital  complex  demodulation  stem  from  the 
memory  and  computation  requirements  We  typically  sam¬ 
ple  the  80-kllz  detector  and  reference  signals  at  200  kHz  for 
325  ms,  yielding  130  000  samples  In  addition,  a  16-kHz  di¬ 
gital  low-pass  filter  (20fiE )  contains  200  coefficients  repre¬ 
senting  the  filter  impulse  response  The  resulting  convolu¬ 
tion  integrals  require  13  min  on  a  VAX  1  1/780  with  a 
nonoptimized  code  The  use  of  an  array  processor  would 
probably  reduce  this  time  requirement  considerably 
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A  new  homodyne  spectroscopy  technique  (T.  Tsukishima  and  O.  Asada,  Jpn.  J.  App!  Phys  17, 

2059  (  1978)  ]  has  been  applied  to  tokamak  microturbulence  measurements  in  order  to  resolve  the 
frequency  spectra  and  w  ave  propagation  direction  of  low-frequency  density  fluctuations 
Application  of  (his  method  provides  a  high-resolution,  inexpensive,  and  easily  implemented 
alternative  to  the  more  technically  demanding  heterodyne  detection  schemes  typically  available 
Comparison  of  heterodyne  and  the  new  homodyne  spectroscopy  results  will  be  made 
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Laser  and  millimeter-wave  scattering  techniques  are  com¬ 
monly  used  to  study  the  space-time  statistics  of  electron  den¬ 
sity  fluctuations  in  tokamak  and  other  plasmas.  Of  particu¬ 
lar  importance  is  the  determination  of  the  direction  of 
propagation  of  the  fluctuations  Since  the  scattering  geome¬ 
try  fixes  the  direction  of  the  scattering  wave  vector  k,  the 
direction  of  propagation  information  is  carried  by  the  sign  of 
the  fluctuation  frequency  u>.  The  fact  that  waves  may  be 
propagating  both  parallel  and  antiparallel  to  k  is  manifested 
by  the  presence  of  blue  and  red  sidebands  centered  around 
the  incident  wave  frequency  (o„. 

To  recover  the  propagation  direction  information  con¬ 
tained  in  the  blue  and  red  sidebands,  heterodyne  detection 
techniques  are  typically  employed.  This  approach  requires 
two  coherent  sources,  with  a  frequency  difference  ho,  to  be 
utilized  as  the  incident  and  local  oscillator  beams  After  mix¬ 
ing,  the  frequency  range  of  the  resultant  signal  is  ho  ^  to, 
where  ca  is  the  frequency  associated  with  the  plasma  fluctu¬ 
ations.  As  long  as  Aru><y,  the  blue/red  sidebands  may  be 
resolved.  In  contrast,  if  a  classical  homodyne  approach  is 
used,  2sa>  =  0,  and  it  is  no  longer  possible  to  unambiguous!, 
determine  the  wave  propagation  direction 

Realization  of  a  heterodyne  detection  system  m  the  tar 
imrared  is  expensive  (e  g  ,  two  lasers)  and  technujllv  h 
trivial  (e  g.,  intermediate  frequenc  v  ( II  i  stabilitv  Im- 
mentation  ofa  rotating  grating  to  fiequencv  'Inn  t  ;> 
the  source  beam  is  a  feasible  alternative  .do  .e‘  " 
quency  offset  is  limited  to  roughlv  Ai  ;  *•)  »i! 
cient  for  microturbulence  measurement-,  w  ■■  ■> 
are  observed  up  to  1  MHz  ;  A  . .  ■nsidc  o-  ,  , 
expensive  methmf  proposed  bv  Imi)  <  ■  \ 

mils  resolution  of  the  wave  pr  -paca'i 
analysis  of  homody  ne  signals  I  ho  r u  ■»  - 
copy  technique  utilizes  udv  .ns  :  •  , 
while  requiring  minor  moOiti.  0,  ■ 
tern 
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Ti.e  IF  output  of  the  scattered  signa  al'r 
real  quantity  described  by 

t'(/)  =  Re^  (  ----  A  s  . 

where  k  .  =  A,  k  is  the  wave  u-. . 
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fluctuation  level  The  sign  i  t v.  •• 
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Fig.  1.  Experimental  arrangement  for  homodyne  spectroscopy  measure¬ 
ment. 


transform  algorithm,  the  frequency  spectrum  was  obtained 
with  a  resolution  of  tuo/lit  —  1/T'~8  kHz.  Mixer  and  am¬ 
plifier  noise  contributions  could  be  subtracted  from  the  auto¬ 
power  spectra  although  signal-to-noise  levels  were  suffi¬ 
ciently  large  so  as  to  make  it  unnecessary. 

A  schematic  of  the  experimental  arrangement  employed 
for  application  of  the  new  homodyne  spectroscopy  tech¬ 
nique  to  collective  far-infrared  scattering  is  shown  in  Fig.  1. 
Detailed  information  on  the  scattering  system  is  described 
by  Park  et  air 

At  a  particular  wave  vector  kl ,  the  scattered  radiation 
beam  is  divided  equally  into  two  components  which  are  cou¬ 
pled  into  the  detectors  by  50%  reflectivity  beam  splitters. 
Similarly,  the  local  oscillator  beam  is  equally  divided  to  pro¬ 
vide  rf  drive  for  each  mixer.  In  one  leg  of  the  local  oscillator 
beam  a  phase  shifter  is  inserted.  This  phase  shifter  consists  of 
a  piece  of  high-density  polyethylene  (excellent  transmission 
characteristics  at  245  GHz)  mounted  on  a  rotation  stage.  By 
tuning  the  rotation  angle,  the  path  of  the  LO  beam  through 
the  polyethylene  is  altered,  thereby  changing  its  phase.  This 
phase  shifter  is  tuned  such  that  there  is  a  90°  phase  difference 
between  the  detected  signals  in  the  two  channels.  The  signal 
from  each  detector  is  then  amplified  and  digitized  so  that  the 
cross-  and  auto-power  spectra  may  be  computed. 

II.  EXPERIMENTAL  RESULTS 

The  new  homodyne  spectroscopy  technique  will  now  be 
applied  to  density  fluctuation  measurements  in  a  high-tem¬ 
perature  tokamak  plasma.  Microturbulence  (low-frequency 


density  fluctuations)  is  driven  by  the  free  energy  associated 
with  plasma  inhomogeneities  such  as  density  and  tempera¬ 
ture  gradients.  For  drift-wave-type  fluctuations,  it  is  predict¬ 
ed  that  the  turbulence  will  exhibit  a  phase  velocity  of  order 
Vo,  =<u/ke  =  (kaT,/eBrnr  )Vnt  XBr/|£,  |,  where  v,^  is 
the  electron  diamagnetic  drift  velocity,  ke  and  oj  are  the 
poloidal  wave  vector  and  frequency  of  the  fluctuation,  BT  is 
the  toroidal  magnetic  field,  Tt  is  the  electron  temperature, 
and  nt  is  the  electron  density.  The  scattering  system  is  posi¬ 
tioned  such  that  the  incident  beam  impinges  upon  the  plas¬ 
ma  from  the  top  of  the  torus  (see  Fig.  1 )  along  a  vertical 
chord  at  the  major  radius  R  =  \m.  This  provides  for  scatter¬ 
ing  from  fluctuations  with  a  poloidal  wave  vector,  ke .  De¬ 
pending  upon  the  orientation  of  the  collection  optics  with 
respect  to  the  incident  beam,  the  wave-number  matching 
condition  (momentum  conservation)  gives  k,  =ko±k, 
where  k5 ,  k^,  and  k  are  the  wave  vectors  of  the  scattered 
beam,  incident  beam,  and  plasma  fluctuation,  respectively. 
Similarly,  energy  conservation  dictates  eos  =co0±  co,  where 
the  subscripts  have  the  same  meaning  as  above.  In  addition, 
it  is  important  to  note  that  for  a  specific  scattering  geometry, 
a  sign  change  (  ±  k)  will  occur  in  the  detected  signal  when 
one  switches  from  the  plasma  top  to  bottom  for  fluctuations 
traveling  in  a  particular  poloidal  direction. 

Experimental  results  from  the  Texas  Experimental  To¬ 
kamak  (TEXT)  for  a  scattering  volume  located  at  the  plas¬ 
ma  bottom  with  poloidal  wave  vector  kB  =7  cm-1  are 
shown  in  Fig.  2.  The  discharge  parameters  were  lp  =  400 
kA,^r  =  28  kG,  and  n,  —  2x  10,Jcm~J.  In  Fig.  2(a),  the 
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Fig  2.  Application  of  homodyne  spectroscopy  technique  to  tokamak  low- 
frequency  microiurbulence  data;  (a)  k.  =  kn  —  k.  and  (b)  k,  =  k,  -t-  k. 
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Fig.  3.  Heterodyne  detection  frequency  spectra. 


fluctuations  are  observed  to  possess  a  clear  peak  at  +  to/ 
2jt~275  ±  50  kHz  in  the  electron  diamagnetic  drift  direc¬ 
tion  as  measured  in  the  laboratory  frame  of  reference.  This 
indicates  a  fluctuation  phase  velocity  ▼,*  (  =  o»/ 
kB  )~2  x  10s  cm/s,  which  is  in  the  drift  wave  region  of  ve¬ 
locities.3  A  substantial  component  is  also  observed  at  to/ 
2ir  <  0,  corresponding  to  the  ion  drift  direction.  If  we  reverse 
the  geometry  from  that  of  Fig.  1,  one  would  expect  to  see  a 
change  in  sign,  which  is  indeed  the  case  as  shown  in  Fig. 
2(b).  The  features  of  the  scattered  spectra  are  the  same  ex¬ 
cept  that  —  to  now  corresponds  to  the  electron  drift  direc¬ 
tion. 

The  component  of  the  frequency  spectra  corresponding 
to  the  ion  drift  direction  may  result  from  factors  other  than  a 
true  ion  drift  feature  of  the  plasma.  On  the  TEXT  tokamak, 
density  fluctuations  in  the  limiter  shadow  and  scrape-off  re¬ 
gions  have  been  observed  to  propagate  in  the  ion  drift  direc¬ 
tion  because  of  a  strong  radial  electric  field  inducing  a  plas¬ 
ma  rotation  effect. 

Heterodyne  scattering  results  for  k&  =  7  cm-'  under 
plasma  conditions  of  Ip  =  300  kA,  Br  =  26  kG,  and 
nt  =  2x  10' '  cm-’  are  shown  in  Fig.  3  for  scattering  vol¬ 
ume  positioned  at  the  plasma  bottom.  The  frequency  differ¬ 
ence  between  the  local  oscillator  and  incident  beams  (IF)  is 
A<u/2ir~  1 100  kHz.  By  subtracting  the  IF  frequency  contri¬ 


bution  from  the  scattered  spectra  and  defining  it  as  the  new 
zero,  one  can  plot  the  fluctuation  spectrum.  The  resolution 
near  zero  frequency  is  limited  by  the  IF  bandwidth  which  is 
±  20  kHz.  As  with  the  homodyne  spectroscopy  technique, 
moving  the  scattering  volume  from  plasma  top  to  bottom 
results  in  a  change  of  sign  for  the  fluctuation  spectrum.  In 
comparing  the  results  of  Figs.  2  and  3,  it  is  very  evident  that 
both  the  new  homodyne  spectroscopy  and  heterodyne  detec¬ 
tion  methods  produce  similar  spectra.  Any  differences  can 
be  attributed  to  plasma  discharge  conditions.  The  homodyne 
spectroscopy  method  provides  improved  resolution  near 
zero  frequency  (  ±  8  kHz). 

III.  SUMMARY 

The  wave  propagation  direction  of  microturbulence  in  a 
tokamak  plasma  is  measured  by  application  of  a  new  homo¬ 
dyne  spectroscopy  technique.  The  accuracy  of  this  method  is 
established  by  comparison  with  results  from  a  heterodyne 
scattering  system  which  are  similar.  Both  techniques  show 
the  low-frequency  density  fluctuations  to  be  propagating 
primarily  in  the  electron  diamagnetic  drift  direction:  how¬ 
ever,  the  broadband  spectra  also  possess  an  appreciable  level 
of  fluctuations  traveling  in  the  ion  drift  direction.  Applica¬ 
tion  of  the  homodyne  spectroscopy  technique  represents  an 
inexpensive  and  easily  implemented  alternative  to  the  more 
technically  demanding  heterodyne  schemes  available  in  the 
far  infrared. 
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ABSTRACT 

A  new  digital  aethod  of  estimating  linear  and 
quadratic  transfer  functions  of  a  quadratic  system 
with  a  general  random  Input  is  presented.  The 
feasibility  of  the  technique  Is  demonstrated  by 
analyzing  simulated  data.  It  Is  also  shown  that 


considerable 

error 

occurs 

in 

estimating 

the 

transfer  functions 

based  on 

a 

Gaussian 

Input 

assumption, 

Gaussian. 

when 

In  fact 

the 

Input  Is 

non- 

1. 

INTRODUCTION 

A  difficulty  encountered  when  one  attempts  to 
apply  the  Volterra  functional  series  to  nonlinear 
problems  Is  the  measurement  of  the  ''olterra  ker¬ 
nels  [see,  e.g.  1].  So  far,  a  fundamental  assump¬ 
tion  underlying  many  approaches  Involves  the  fact 
that  the  "Input"  la  assumed  to  be  a  stationary 
random  process  which  possesses  Gaussian  statis¬ 
tics,  an  assumption  which  allows  a  substantial 
simplification  of  the  relevant  mathematics.  In 
many  practical  cases,  however,  the  input  excita¬ 
tion  Is  not  under  the  direct  control  of  the  exper¬ 
imentalist  which  precludes  the  use  of  the  so- 
called  "probing"  method,  thus  one  must  use  the 
measured  input  data  as  they  are  observed. 

It  has  been  shown  in  [2]  that,  for  a  zero- 
mean  Gaussian  input,  expressions  for  the  linear 
and  quadratic  transfer  functions  are  respectively 
given  in  terms  of  various  spectral  moments  up  to 
third  order  (l.e.,  the  blspectrum).  However,  when 
a  general  random  Input  Is  applied  to  the  system, 
It  is  extremely  difficult  to  find  such  closed  form 
expressions  for  the  transfer  functions.  Katzenel- 
son  and  Could  [3]  described  an  Iterative  method  to 
solve  this  problem,  and  Eykhofr  [A]  considered  a 
discrete  time  version.  Also  Rltz  and  Powers  [5] 
showed  that  when  the  Input  Is  weakly  non-Gausalan, 
transfer  functions  of  a  quadratic  system  can  be 
estimated  by  an  Iterative  approach  In  the  discrete 
frequency  domain. 

It  is  the  purpose  or  this  paper  to  describe  a 
new  digital  method  of  processing  Input  and  output 
signals  in  order  to  quantitatively  measure  the 
linear  and  the  quadratic  transfer  functions  even 
when  ws  cannot  assume  a  particular  characteristic 
of  the  Input.  In  the  next  section,  it  la  shown 
that  the  linear  and  the  quadratic  transfer  func¬ 


tions  can  be  evaluated  in  the  discrete  frequency 
domain  by  solving  matrix  equations.  In  section  3, 
a  known  system  Is  analyzed  to  show  the  feasibility 
of  the  analysis  results.  In  addition,  the  results 
are  compared  with  the  transfer  functions  estimated 
by  the  "Gaussian  input  method"  In  order  to  Illus¬ 
trate  the  deleterious  effects  of  assuming  a  Gauss¬ 
ian  input  when  In  reality  it  is  not. 

2.  ESTIMATION  OF  TRANSFER  FUNCTIONS 

Since  we  will  concentrate  on  frequency  domain 
analysis  and  the  objective  Is  to  find  a  digital 
method  that  can  be  practically  implemented,  we 
will  start  from  the  Input-output  relationship  in 
the  discrete  frequency  domain.  In  the  following, 
we  assume  the  unknown  nonlinear  system  Is  of 
second  order  (l.e.,  quadratic),  thus,  higher  order 
terms  may  be  sarely  neglected.  Then  the  model  to 
be  studied  can  be  expressed  as  follows; 

X(fm)  “  Hi(fm>x(fm)  *  1  1  H2(fk,ri)X,fk)X(fl>  (1) 

fc*l-m 

where  X(f)  and  7(f)  respectively  represent  the 
discrete  Fourier  transforms  (DFT's)  for  a  finite 
number  (N)  of  observations  of  the  Input  and  the 
output  signals  of  the  nonlinear  system  described 
by  Volterra  series  up  to  second  order.  On  the 
other  hand,  H.(fB)  and  *Mfk,fl^  are  llnear 
quadratic  transfer  functions  which  are  given  by 
the  Fourier  transforms  of  Volterra  kernels  at  a 
discrete  set  of  frequencies 

|fn-n/N;  n--(N-1)/2 . -1,0,1 . N/2) .  It  will 

be  assumed  thst  the  quadratic  transfer  function 
H.(f  ,f.)  is  a  symmetric  function  of  Its  argu¬ 
ments,  i.e.,  H2<fk*ri)“H2(ri,rk)' 

Determination  of  the  linear  and  quadratic 
transfer  functions  in  terms  of  the  Input  and  out¬ 
put  characteristics  can  be  carried  out  by  solving 
the  following  set  or  equ|tlons  which  |re  obtained 
by  multiplying  (1)  by  X  (fB)  and  X  (f.)X  (f  ), 
respectively,  and  then  taking  an  expectedvalueJof 
each  side. 

EtxV.mrj]  -  H1(fB)E[|x(fB)|2] 

♦  I  £  H^.f^EtxV^Wr^XUj))  (2) 
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Note  that 


eanlngful  only 


fB-fk*f,«f , *f ,  because  of  the  properties  of  higher 
order  spectral  moments  £6]. 

When  the  system  input  Is  zero-mean  Gaussian, 
the  terms  containing  the  third  order  moment  of  the 
Input  in  (2)  and  (3)  vanish.  In  this  case,  the 
linear  and  the  quadratic  transfer  functions  can  be 
determined  separately  and  expressed  by  the  various 
spectra  up  to  third  order  £2],  Hors  specifically, 
they  are  given  by 
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E£X  (fB)Y(fB)] 
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H2(fk*fl>  * 


EExV^xV^w^fj)] 
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k*l«0. 


However,  for  the  case  of  a  general  input,  we  have 
to  solve  (2)  and  (3)  simultaneously  so  that  It  la 
extremely  difficult  to  find  the  closed  form  solu¬ 
tions  like  (k)  and  (5). 

Next,  we  will  describe  a  method  of  solving 
(2)  and  (3)  which  can  be  digitally  Implemented. 
Due  to  the  symmetrlclty  assumed  for  H  (f  ,f  ),  we 
can  express  the  output  only  In  tend  or  the  por¬ 
tions  of  the  quadratic  transfer  function  which  are 
In  the  sum  and  difference  Interaction  regions  of 
the  two  dimensional  frequency  domain  (regions  S 
and  D  In  Fig.  1).  Using  this  fact  and  expanding 
the  summation  term  In  (1),  we  can  rewrite  (1)  In 
the  following  vector  formi 

Y(fB)  -  fj^X  -  XfcH  (6) 
where  t  denotes  transposition,  and 


Fig.  1  Two  .dimensional  frequency 
domain. 


H  -  (E[X*Xt]»‘,E[X*Y(fi|)]  (8) 

If  E[nl]  is  not  singular.  The  solution  given  by 
(8)  can  also  be  considered  as  a  result  of  mul¬ 
tivariate  linear  regression  analysis  and  thus  the 
transfer  functions  obtained  In  this  manner  are 
optimum  In  the  mean  square  sense  when  there  la  any 
additive  noise  present  In  the  output.  Note  that 
EfJC  is  a  Hermit  lan  matrix  consisting  of  vari¬ 
ous'  spectral  moments  of  the  Input  signal.  Except 
for  the  rirst  element,  the  rirst  row  and  column  of 
this  matrix  represent  the  bispectrum  idille  the 
first  element  Is  the  auto  power  spectrum  of  the 
Input.  The  remaining  elements  are  fourth  order 
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In  (6),  fB  signifies  the  Nyqulst  frequency  associ¬ 
ated  with  the  sampling  of  the  Input  and  output 
signals.  Then  solving  simultaneously  (2)  and  (3) 
Is  equivalent  to  solving  the  following  matrix 
equation; 

Erx*Y(fB)]  .  E£xV]H.  (7) 

Equation  (7)  Is  linear  In  the  transfer  function 
vector  H,  and  so  H  Is  given  by 


spectral  moments  of  the  Input.  The  sice  of  the 
matrix  E£JC  3Cl]  to  be  inverted  depends  on  the 
number  of  data  points  (N-2M)  taken  for  the  DFT's 
and  the  frequency  Index  m.  For  example.  It  Is  a 
(H»2-m/2)*(M*2-m/2)  matrix  If  m  Is  even,  and  It  Is 
(K*2-=ji)*(H*2-2— L)  when  n  is  odd.  Therefore  the 
size  decreases  aft  m  Increases.  The  largest  one  Is 
(H»2)»(M*2)  when  m-0,  and  the  smallest  one  Is 
(|*2)«( |*2)  when  m-M-1  or  M. 


3 


If  the  Input  is  zero-mgap  Gaussian,  It  can  be 
shown  that  the  matrix  E  [£  X1 ]  becomes  a  diagonal 
matrix  and  thus  the  transfer-  functions  given  by 
(8)  have  the  same  expressions  for  the  linear  and 
quadratic  transfer  functions  as  those  given  by  (4) 
and  (5).  Consequently  the  solution  given  by  (8) 
Is  a  general  one  for  the  transfer  functions  of  a 
quadratically  nonlinear  system  with  an  arbitrary 
random  Input,  i.e..  It  Includes  the  zero' mean 
Gaussian  input  as  a  special  case. 

3.  ANALYSIS  OF  SIMULATED  DATA 

FORTRAN  programs  have  been  written  on  the 
basis  of  the  analysis  results  described  In  the 
previous  section  and  tested  by  analyzing  simulated 
data  generated  by  a  known  system. 

The  known  system  is  formed  as  follows; 
y(t)  -  -0.64x(t)  ♦  x( t-2)  ♦  0.9x2(t)  ♦  x2(t-l).(9) 
Therefore  the  transfer  functions  are  given  by 

H1  (f )  -  -0.64  ♦  e'll,wf  (10) 

-12x(f  «f  ) 

H2(f1 ,f2)  -  0.9  ♦  e  .  (11) 

These  actual  transfer  functions  of  the  given  sys¬ 
tem  are  shown  in  Fig.  2  and  Fig.  3. 

The  approach  developed  In  this  paper  has  been 
tested  by  applying  a  zero-mean  exponentially  dis¬ 
tributed  input  signal  which  Is  generated  by  IMSL 
-outlne.  For  both  the  Input  and  output  signals, 
128000  sample  points  or  data  record  have  been  gen¬ 
erated,  and  they  were  divided  up  Into  2000  seg¬ 
ments  of  64  data  points  each. 


two  methods.  In  this  case,  the  mean  square  errors 
have  been  calculated  as  follows;  by  the  Gaussian 
Input  method,  MSE.-9.5,  MSE  -4.0,  and  by  the  gen¬ 
eral  input  method,  MSF.j-o. 06,  USE  -0.04.  Compar¬ 
ing  the  mean  square  error  valuSs  as  well  as  the 
plots,  one  can  clearly  see  the  obvious  differences 
between  the  transfer  functions  estimated  by  the 
two  methods.  This  Indicates  that.  In  order  to 
obtain  useful  estimates,  one  must  use  the  new 
method  developed  in  this  paper  when  there  is  not 
any  good  a  priori  knowledge  about  the  Input  signal 
statistics  or  when  the  input  Is  not  a  zero-mean 
Gaussian  signal. 

4.  CONCLUSION 

In  this  paper,  we  have  discussed  the  problem 
of  estimating  system  transfer  functions  by  pro¬ 
cessing  random  Input  and  output  data,  and 
described  a  new  digital  estimation  method  which 
can  be  successfully  applied  to  the  modeling  of  a 
quadratically  nonlinear  system  excited  by  a  non- 
Gausslan  Input.  Analyzing  simulated  data,  we  have 
also  demonstrated  the  feasibility  of  the  tech¬ 
nique.  Considering  the  computation  time  and  the 
size  of  the  processor  memory  available  for  practi¬ 
cal  applications.  It  Is  desirable  that  the  number 
of  data  points  In  each  segment  of  record  data  be 
not  too  large  since  It  determines  the  dimensions 
of  the  matrices  that  must  be  Inverted  In  order  to 
solve  the  matrix  equations. 
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For  a  quantitative  measure  of  the  quality  of 
the  estimates,  the  normalized  mean  square  errors 
Involved  In  the  computed  transfer  functions  are 
defined  as  follows; 
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where  *  signifies  an  estimated  quantity,  and  N  Is 
the  number  of  frequency  points  In  the  regions  S 
and  D. 

By  using  the  input  and  output  signals  gen¬ 
erated  above,  the  estimation  of  the  linear  and 
quadratic  transfer  functions  have  been  carried  out 
first  by  the  Gaussian  Input  method  (Eqa.  (4)  and 
(8)),  and  then  the  results  are  compared  with  the 
estimates  that  have  been  obtained  by  the  general 
input  method  developed  In  this  paper  (Eq.  (6)). 
Fig.  4  and  Fig.  5,  respectively,  show  the  linear 
and  quadratic  transfer  functions  estimated  by  the 
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Abstract— A  method  of  measuring  transfer  functions  for  nonlinear 
dyadic-invariant  (NLDI)  systems  described  by  Barrett's  orthogonal  model 
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I.  Introduction 

Ever  since  J.  L.  Walsh  published  a  complete  set  of  orthogonal 
functions  |l|.  Walsh  functions  have  been  one  of  the  most  important 
examples  of  nonsinusoidal  functions  in  engineering  applications.  The 
computational  efficiency  of  the  fast  Walsh  transform  (FWT)  and  the 
feasibility  of  its  software  and  hardware  implementation  provide  the 
incentive  to  find  useful  applications.  On  the  other  hand,  motivated  by 
the  fact  that  the  Walsh  functions  form  the  natural  basis  for 
representing  dyadic-invariant  systems  just  as  trigonometric  functions 
do  for  time-invariant  systems,  some  studies  have  been  carried  out  to 
find  useful  properties  of  the  dyadic-invariant  systems.  For  example, 
an  optimal  linear  dyadic-invariant  (LDI)  system  was  developed  |2|. 
the  problem  of  modeling  a  multiple  input/output  LDI  system  in  terms 
of  sequency  transfer  functions  was  considered  |3|.  and  Walsh  series 
expansions  were  utilized  to  develop  a  method  of  measuring  the 
kernels  up  to  order  two  in  Wiener's  nonlinear  system  model  |4|. 

In  this  study,  we  present  a  method  of  measuring  kernels  for  the 
identification  of  a  class  of  nonlinear  systems  which  can  be  described 
by  Barrett's  model.  Since  we  let  the  model  have  dyadic  convolution 
operations,  we  refer  to  these  systems  as  nonlinear  dyadic-invariant 
(NLDI)  systems.  In  particular,  we  will  obtain  the  expressions  for 
kernels  up  to  third  order  using  Walsh  transform  techniques.  Thus  this 
study  can  be  considered  as  an  extension  (through  a  different 
approach)  of  the  results  in  |4|.  and  also  a  parallel  study  to  that  which 
has  been  carried  out  by  Hong  el  at.  in  the  Fourier  frequency  domain 
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By  orthogonal  izing  (he  Volterra  functional  series  for  a  white 
Gaussian  input  signal.  Wiener  provided  a  representation  for  nonlinear 
systems  |6|.  In  many  practical  situations,  however,  the  input  to  the 
system  is  usually  nonwhitc.  In  such  cases,  Barrett's  orthogonal  model 
|7)  for  nonlinear  systems  is  a  useful  one  since  it  is  valid  for  Gaussian 
inputs  with  arbitrary  spectral  densities. 
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II  Development 

Now  we  consider  the  space  of  real  nonncgative-coordinate  signals. 
Thus  the  input  x\t)  is  defined  for  /  >  0.  Following  Barrett's 
orthogonal  representation,  we  can  express  the  output  y(t)  of  an 
unknown  NLDI  system  for  a  zero-mean  dyadic  stationary  |3] .  |8) 
Gaussian  input  process  x(t)  as  follows: 

|*® 

y(l)  =  h 0+  6i(f,)xr(f  ®  fi)  dt, 

Jo 

(oo  poo 

h2(t i,  f:){x(f  ®  /,)jr(r  ®  /■>) 

o  *'n 

-£(*(/  ®  t,)x(t  ®  r2)|}  dt ,  dt2 

J®  I1®  j*n 

*j(/i.  h.  ®  fi)x(f ®  t2)x(t ©  f,) 

0  *'o  Jo 

-X((®  /,)£■(  jr(z  ©  t2)x{t  ©  /,)) 

-x(7®  r2)£|.r(/©  h)x(tm  /,)] 

-x(f®  fj)£|x(/®  /,) x(/m  f2)J}  dt i  dt2  di}+ ■■■  (1) 

where  h„  is  the  nth  order  kernel.  £[ ■  |  denotes  the  expectation 
operator,  and  the  operator  ®  signifies  modulo-two  addition  (without 
carry)  of  the  two  real  numbers  involved.  In  the  following  context,  and 
without  the  loss  of  generality,  we  assume  that  the  kernels  are 
symmetric  functions  in  their  arguments  since  it  can  be  seen  that  the 
output  y(l)  would  be  identical  for  any  permutation  of  the  arguments. 
The  Walsh  transform  of  (1)  is  given  by 

Y(o)  =  =  (  y(f)+(o,  f)  dt 

Jo 

-h0 Mo)  +  I  H,{o,)X(o,)6(o  ®  a,)  do, 

Jo 

+  (  (  W2(0i,  <J2){ X(0|)X (<j2) 

Jo  Jo 

-  E\ X(o,)X (<J2)|}5(a  «  <T|  ®  a2)  do,  do2 

+  (  f  f  Wi(Oi,  o2,  o,){X{o,)X(o2)X(o2) 

Jo  Jo  Jo 

-  *(<7,)£|  Ar(<72)  ,Y<0j)|  -  X(o2)E\X(o2)X(o,)] 

-  X(Oi)E\X(o,)X(o2))} 

■  Ho  ®  o,  ®  o2  ®  oj)  do |  do2  doi  +  •  ■  •  (2) 

where  ' }  signifies  the  Walsh  transform  operator,  ♦(<r,  t)  denotes 
a  generalized  Walsh  function  |8|,  and  H„(o ,,  o2,  ■  •  on)  and  -Y(o) 
are  Walsh  transforms  of  the  mh  order  kernel  and  the  input, 
respectively.  We  refer  to  H„(o i,  o2,  ■  ■  ■ ,  o„)  as  the  nth  order  transfer 
function  in  the  Walsh  sequency  domain.  Note  that  the  nth  order 
transfer  function  has  the  same  symmetry  property  as  that  of  the  nth 
order  kernel.  In  (2),  the  delta  function  6 (o)  is  the  Walsh  transform  of 
the  unit  step  function,  i.e.,  6(o )  =  !"♦(<».  0  dt,  and  it  has  the 
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following  properties  |8|  which  are  similar  to  those  of  the  Dirac  delta 
function 

(  f(t)Mt  *  /,)  di  =/(/,) 

Jo 

A 

Ml  •  / 1 )  dl  -  \,  for  I,  >0. 
o 

To  get  (2)  from  ( I ).  we  have  utilized  the  product  rules  of  generalized 
Walsh  functions  which  are  expressed  as 

♦(0,.  /)+(<>.'.  /)  =  *(0|  ®  o;,  O 

+(<J,  O'Kir  /,)  =  Ko,  1 1  9  I,). 


obtain  (3)-(5)  is  equivalent  to  the  approach  in  which  we  minimize  the 
mean-square  error  between  the  output  of  the  actual  system  and  the 
model  |I2J,  1 1 3).  Therefore,  considering y(t)  in  (I)  as  the  estimate  of 
the  desired  signal  and  x{t)  as  the  signal  immersed  in  noise,  the 
transfer  functions  developed  in  this  paper  would  specify  the  nonlinear 
filter  which  is  optimum  in  the  mean-square  sense  [13).  [14). 
Although,  due  to  the  lack  of  a  simple  relationship  between  arithmetic 
convolution  and  logical  convolution,  the  existence  of  natural  systems 
that  can  be  modeled  by  ( I )  is  open  to  question,  we  can  extend  the 
results  developed  in  this  paper  so  as  to  design  an  optimal  NLDI  filter 
which  can  be  used  as  an  alternative  to  a  time-invariant  filter.  The 
implementation  as  well  as  the  design  of  such  filters  in  the  Walsh 
sequency  domain  would  be  very  efficient. 


By  averaging  (2)  and  then  integrating.  /t„  is  readily  obtained  such 
that 


£ |  Y(a) |  da. 


Since  we  assume  that  the  transfer  functions  are  symmetric 
functions  of  their  arguments,  other  transfer  functions  up  to  order  n, 
i.e.,  0;,  a„).  may  be  obtained  by  multiplying  (2)  by 

A'fer )A"{cr )  ■  •  •  X(a^  ),  respectively,  and  then  taking  an  expected 
value  of  each  side.  Because  of  the  Gaussian  nature  of  the  input 
process  and  the  orthogonality  of  the  functional  series  in  (2),  only  one 
term  containing  H„(o\ ,  •  •  • ,  a„)  remains  in  calculating  £[  Y(a)X(a  ' ) 
•  •  -  X(a'n  )).  In  particular,  the  transfer  functions  up  to  third  order  are 
given  as  follows: 


H,(o)  = 


r  ,Ao) 

rr(o) 


(3) 


HAo 


i 


q;)-6urt(g[)6(0|  9  o2) 

r*(<j,)rr(02) 


(4) 


Hi(o  i. 


rrw(0i,  02,  qj) 

r,(«t)rT(o2)rJr(0J) 


Hiio\) 

r  v(ctj) 


6 (g2  9  0,) 


rx(o,) 


6(g|  ®  gj)  + 


W|(g3) 

fr(g2) 


S(a,  9  g2) 


)] 


(5) 


In  (3)-(5),  r,(g)  is  the  Walsh  sequency  power  spectral  density 
function  of  x(t),  and  x(g|,  g2.  •  ■  • ,  a„)  is  the  nth  order  Walsh 
cross  spectrum  such  that  r,(g)6{g  ®  g|)  =  £[  X(a)X(ai)],  and 
IV  ,(0i,  g2.  o„)Mo,  ®  g2  ®  ■  •  ®  g„  ®  g)  = 

£(  T(g)A'(g,)A'(g2)  •  •  •  A'fg,))  Note  that  the  nth  order  Walsh 
spectrum  can  be  defined  as  the  Walsh  transform  of  the  nth  order 
logical  correlation  function,  and,  in  general,  it  is  not  directly  related 
to  the  Fourier  spectrum.  In  the  case  of  the  second-order  moment,  the 
relationship  between  discrete  Walsh  and  Fourier  power  spectra  and 
their  computation  was  discussed  in  (9),  and  the  algorithmic  properties 
of  logical  and  arithmetic  autocorrelation  functions  were  investigated 
in  (10). 

We  can  see  that  the  resulting  expressions  for  the  kernels  up  to  third 
order  have  much  the  same  form  as  those  obtained  in  the  Fourier 
frequency  domain  [4],  except  that  they  are  expressed  in  terms  of 
Walsh  instead  of  Fourier  spectra.  This  result  implies  that  we  may  take 
advantage  of  the  computation  procedure  with  which  we  are  familiar 
in  Fourier  analysis  [II];  that  is,  the  nth  order  transfer  function 
H„(a{,  g2,  • g„)  can  be  obtained  from  the  raw  input  and  output 
time-series  data  by  computing  the  appropriate  power  spectra  using 
the  fast  Walsh  transform  algorithm. 

Finally,  we  note  that  the  statistical  approach  used  in  this  paper  to 
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